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METHODS FOR IDENTIFYING SMALL MOLECULES THAT 
MODULATE PREMATURE TRANSLATION TERMINATION 

AND NONSENSE MEDIATED mRNA DECAY 



1. INTRODUCTION 

The present invention relates to a method for screening and identifying test 
compounds that modulate premature translation termination and/or nonsense-mediated 

1 0 messenger ribonucleic acid ("mRNA") decay by interacting with a preselected target 
ribonucleic acid ("RNA"). In particular, the present invention relates to identifying test 
compounds that bind to regions of the 28S ribosomal RNA ("rRNA") and analogs thereof- 
Direct, non-competitive binding assays are advantageously used to screen libraries of 
compounds for those that selectively bind to a preselected target RNA. Binding of target 

1 5 RNA molecules to a particular test compound is detected using any physical method that 
measures the altered physical property of the target RNA bound to a test compound. The 
methods of the present invention provide a simple, sensitive assay for high-throughput 
screening of libraries of compounds to identify pharmaceutical leads. 

20 2. BACKGROUND OF THE INVENTION 

Protein synthesis encompasses the processes of translation initiation, 
elongation, and termination, each of which has evolved to occur with great accuracy and has 
the capacity to be a regulated step in the pathway of gene expression. Recent studies, 
including those suggesting that events at termination may regulate the ability of ribosomes 

25 to recycle to the start site of the same mRNA, have underscored the potential of termination 
to regulate other aspects of translation. The RNA triplets UAA, UAG, and UGA are non- 
coding and promote translational termination. Termination starts when one of the three 
termination codons enters the A site of the ribosome signaling the polypeptide chain release 
factors to bind and recognize the termination signal. Subsequently, the ester bond between 

30 the 3' nucleotide of the transfer RNA ("tRNA") located in the ribosome's P site and the 
nascent polypeptide chain is hydrolyzed, the completed polypeptide chain is released, and 
the ribosome subunits are recycled for another round of translation. 

Nonsense-mediated mRNA decay is a surveillance mechanism that 
minimizes the translation and regulates the RNA stability of nonsense RNAs that contain 

35 chain termination mutations (see, e.g., Hentze & Kulozik, 1999, Cell 96:307-3 10; 

Culbertson, 1999, Trends in Genetics 15:74-80; Li & Wilkinson, 1998, Immunity 8:135- 
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141 ; and Ruiz-Echevarria et al, 1996, Trends in Biological Sciences, 21 :433-438). Chain 

termination mutations are caused when a base substitution or frameshift mutation changes a 

codon into a termination codon, i.e., a premature stop codon that causes translational 

termination. In nonsense-mediated mRNA decay, mRNAs with premature stop codons are 

frequently subject to degradation. A truncated protein is produced as a result of the 

translation apparatus prematurely terminating at the stop codon. 

Certain classes of known antibiotics have been characterized and found to 

interact with RNA. For example, the antibiotic thiostrepton binds tightly to a 60-mer from 

ribosomal RNA (Cundliffe et al, 1990, in The Ribosome: Structure, Function & Evolution 

(Schlessinger et al, eds.) American Society for Microbiology, Washington, D.C. pp. 479- 

490). Bacterial resistance to various antibiotics often involves methylation at specific rRNA 

sites (Cundliffe, 1989, Ann. Rev. Microbiol. 43:207-233). Aminoglycosidic aminocyclitol 

(aminoglycoside) antibiotics and peptide antibiotics are known to inhibit group I intron 

splicing by binding to specific regions of the RNA (von Ahsen et al , 1 99 1 , Nature 

(London) 353 :368-370). Some of these same aminoglycosides have also been found to 

inhibit hammerhead ribozyme function (Stage et al , 1995, RNA 1 :95-l 01). In addition, 

certain aminoglycosides and other protein synthesis inhibitors have been found to interact 

with specific bases in 16S rRNA (Woodcock etal, 1991, EMBO J. 10:3099-3103). An 

oligonucleotide analog of the 16S rRNA has also been shown to interact with certain 

aminoglycosides (Purohit et al, 1994, Nature 370:659-662). A molecular basis for 

hypersensitivity to aminoglycosides has been found to be located in a single base change in 

mitochondrial rRNA (Hutchin et al, 1993, Nucleic Acids Res. 21:4174-4179). 

Aminoglycosides have also been shown to inhibit the interaction between specific structural 

2 — RNA motifs and the corresponding RNA binding protein. Zapp et al (Cell, 1993, 74:969- 

978) has demonstrated that the aminoglycosides neomycin B, lividomycin A, and 

tobramycin can block the binding of Rev, a viral regulatory protein required for viral gene 

expression, to its viral recognition element in the IIB (or RRE) region of HIV RNA. This 

blockage appears to be the result of competitive binding of the antibiotics directly to the 

RRE RNA structural motif. 

30 

Aminoglycosides have also been found to promote nonsense suppression 
(see, e.g., Bedweil etal, 1997, Nat. Med. 3:1280-1284 and Howard etal, 1996, Nat. Med. 
2:467-469). Nonsense mutations cause approximately 10 to 30 percent of the individual 
cases of virtually all inherited diseases. Although nonsense mutations inhibit the synthesis 
of a full-length protein to one percent or less of wild-type levels, minimally boosting the 
expression levels of the full-length protein to between five and fifteen percent of normal 
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levels can eliminate or greatly reduce the severity of the disease. Clinical approaches that 
target the translation termination event to promote nonsense suppression have recently been 
described for model systems of cystic fibrosis and muscular dystrophy. Gentamicin is an 
aminoglycoside antibiotic that causes translational misreading and allowed the insertion of 

5 an amino acid at the site of the nonsense codon in models of cystic fibrosis, Hurlers 
Syndrome, and muscular dystrophy (see, e.g., Barton-Davis et aL, 1999, J. Clin. Invest 
104:375-381). These results strongly suggest that drugs that promote nonsense suppression 
by altering translation termination efficiency of a premature termination codon can be 
therapeutically valuable in the treatment of diseases caused by nonsense mutations. 

10 Citation or identification of any reference in Section 2 of this application is 

not an admission that such reference is available as prior art to the present invention. 

3. SUMMARY Q 1? THF. INVENTION 
The present invention relates to methods for identifying compounds that 
modulate translation termination and/or nonsense-mediated mRNA decay by identifying 
compounds that bind to preselected target elements of nucleic acids including, but not 
limited to, specific RNA sequences, KNA structural motifs, and/or RNA structural 
elements. In particular, the present invention relates to identifying test compounds that bind 
to regions of the 28S rRNA and analogs thereof. The specific target RNA sequences, RNA 
structural motifs, and/or RNA structural elements (i.e., regions of the 28S rRNA and 
analogs thereof) are used as targets for screening small molecules and identifying those that 
directly bind these specific sequences, motifs, and/or structural elements. For example, 
methods are described in which a preselected target RNA having a detectable label is used 
to screen a library of test compounds, preferably under physiologic conditions. Any 
complexes formed between the target RNA and a member of the library are identified using 
methods that detect the labeled target RNA bound to a test compound. In particular, the 
present invention relates to methods for using a target RNA having a detectable label to 
screen a bead-based library of test compounds. Compounds in the bead-based library that 
bind to the labeled target RNA will form a bead-based detectably labeled complex, which 
can be separated from the unbound beads and unbound target RNA in the liquid phase by a 
number of physical means, including, but not limited to, flow cytometry, affinity 
chromatography, manual batch mode separation, suspension of beads in electric fields, and 
microwave of the bead-based detectably labeled complex. The detectably labeled complex 
can then be identified by the label on the target RNA and removed from the uncomplexed, 
35 unlabeled test compounds in the library. The structure of the test compound complexed 
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with the labeled RNA is then ascertained by de novo structure determination of the test 
compounds using, for example, mass spectrometry or nuclear magnetic resonance 
("NMR"). The test compounds identified are useful for any purpose to which a binding 
reaction may be put, for example in assay methods, diagnostic procedures, cell sorting, as 
inhibitors of target molecule function, as probes, as sequestering agents and the like. In 
addition, small organic molecules which interact specifically with target RNA molecules 
may be useful as lead compounds for the development of therapeutic agents. 

The methods described herein for the identification of compounds that 
directly bind to 28S rRNA are well suited for high-throughput screening. The direct 
binding method of the invention offers advantages over drug screening systems for 
competitors that inhibit the formation of naturally-occurring RNA binding proteimtarget 
RNA complexes; i.e., competitive assays. The direct binding method of the invention is 
rapid and can be set up to be readily performed, e.g., by a technician, making it amenable to 
high-throughput screening. The method of the invention also eliminates the bias inherent in 
the competitive drug screening systems, which require the use of a preselected host cell 
factor that may not have physiological relevance to the activity of the target RNA. Instead, 
the methods of the invention are used to identify any compound that can directly bind to 
28S rRNA preferably under physiologic conditions. As a result, the compounds so 
identified can inhibit the interaction of the target RNA with any one or more of the native 
host cell factors (whether known or unknown) required for activity of the RNA in vivo. 

The present invention may be understood more fully by reference to the 
detailed description and examples, which are intended to illustrate non-limiting 
embodiments of the invention. 

3.1. Definitions 

As used herein, a "target nucleic acid" refers to RNA, DNA, or a chemically 
modified variant thereof In a preferred embodiment, the target nucleic acid is RNA. A 
target nucleic acid also refers to tertiary structures of the nucleic acids, such as, but not 
limited to loops, bulges, pseudoknots, guanosine quartets and turns. A target nucleic acid 
also refers to RNA elements such as, but not limited to, 28S rRNA and structural analogs 
thereof, which are described in Section 5.1. Non-limiting examples of target nucleic acids 
are presented in Section 5.1 

As used herein, a "library" refers to a plurality of test compounds with which 
a target nucleic acid molecule is contacted. A library can be a combinatorial library, e.g., a 
collection of test compounds synthesized using combinatorial chemistry techniques, or a 
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collection of unique chemicals of low molecular weight (less than 1000 daltons) that each 
occupy a unique three-dimensional space. 

As used herein, a "label" or "detectable label" is a composition that is 
5 detectable, either directly or indirectly, by spectroscopic, photochemical, biochemical, 
immunochemical, or chemical means. For example, useful labels include radioactive 
isotopes (e.g., 32 P, 35 S, and 3 H), dyes, fluorescent dyes, electron-dense reagents, enzymes 
and their substrates (e.g., as commonly used in enzyme-linked immunoassays, e.g, alkaline 
phosphatase and horse radish peroxidase), biotin, digoxigenin, or haptens and proteins for 
1Q which antisera or monoclonal antibodies are available. Moreover, a label or detectable 
moiety can include an "affinity tag" that, when coupled with the target nucleic acid and 
incubated with a test compound or compound library, allows for the affinity capture of the 
target nucleic acid along with molecules bound to the target nucleic acid. One skilled in the 
art will appreciate that a affinity tag bound to the target nucleic acids has, by definition, a 
1 5 complimentary ligand coupled to a solid support that allows for its capture. For example, 
useful affinity tags and complimentary ligands include, but are not limited to, 
biotin-streptavidin, complimentary nucleic acid fragments (e.g., oligo dT-oligo dA, oligo 
T-oligo A, oligo dG-oligo dC, oligo G-oIigo C), aptamer complexes, or haptens and 
proteins for which antisera or monoclonal antibodies are available. The label or detectable 
2Q moiety is typically bound, either covalently, through a linker or chemical bound, or through 
ionic, van der Waals or hydrogen bonds to the molecule to be detected. 

As used herein, a "dye" refers to a molecule that, when exposed to radiation, 
emits radiation at a level that is detectable visually or via conventional spectroscopic means. 
As used herein, a "visible dye" refers to a molecule having a chromophore that absorbs 
25 radiation in the visible region of the spectrum (/.e. , having a wavelength of between about 
400 run and about 700 run) such that the transmitted radiation is in the visible region and 
can be detected either visually or by conventional spectroscopic means. As used herein, an 
"ultraviolet dye" refers to a molecule having a chromophore that absorbs radiation in the 
ultraviolet region of the spectrum (i.e., having a wavelength of between about 30 nm and 
3Q about 400 nm). As used herein, an "infrared dye" refers to a molecule having a 

chromophore that absorbs radiation in the infrared region of the spectrum (i.e., having a 
wavelength between about 700 nm and about 3,000 nm). A "chromophore" is the network 
of atoms of the dye that, when exposed to radiation, emits radiation at a level that is 
detectable visually or via conventional spectroscopic means. One of skill in the art will 
35 readily appreciate that although a dye absorbs radiation in one region of the spectrum, it 
may emit radiation in another region of the spectrum. For example, an ultraviolet dye may 
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emit radiation in the visible region of the spectrum. One of skill in the art will also readily 
appreciate that a dye can transmit radiation or can emit radiation via fluorescence or 
phosphorescence. 

The phrase "pharmaceutical^ acceptable salt(s)," as used herein includes but 
is not limited to salts of acidic or basic groups that may be present in test compounds 
identified using the methods of the present invention. Test compounds that are basic in 
nature are capable of forming a wide variety of salts with various inorganic and organic 
acids. The acids that can be used to prepare pharmaceutically acceptable acid addition salts 
of such basic compounds are those that form non-toxic acid addition salts, i.e. , salts 
containing pharmacologically acceptable anions, including but not limited to sulfuric, citric, 
maleic, acetic, oxalic, hydrochloride, hydrobromide, hydroiodide, nitrate, sulfate, bisulfate, 
phosphate, acid phosphate, isonicotinate, acetate, lactate, salicylate, citrate, acid citrate, 
tartrate, oleate, tannate, pantothenate, bitartrate, ascorbate, succinate, maleate, gentisinate, 
fumarate, gluconate, glucaronate, saccharate, formate, benzoate, glutamate, 
methanesulfonate, ethanesulfonate, benzenesulfonate, p-toluenesulfonate and pamoate (/.<?., 
l,l'-methylene-bis-(2-hydroxy-3-naphthoate)) salts. Test compounds that include an amino 
moiety may form phannaceutically or cosmetically acceptable salts with various amino 
acids, in addition to the acids mentioned above. Test compounds that are acidic in nature 
are capable of forming base salts with various pharmacologically or cosmetically acceptable 
cations. Examples of such salts include alkali metal or alkaline earth metal salts and, 
particularly, calcium, magnesium, sodium lithium, zinc, potassium, and iron salts. 

By "substantially one type of test compound," as used herein, is meant that 
the assay can be performed in such a fashion that at some point, only one compound need be 
used in each reaction so that, if the result is indicative of a binding event occurring between 
the target RNA molecule and the test compound the test compound, can be easily identified. 



4. DESCRIPTION OF DRAWINGS 

FIG. 1 . The human 28S rRNA. Domains II and V are circled. 

FIG. 2. Small molecules involved in nonsense suppression alter the chemical 

footprinting pattern in Domain V of the 28S rRNA. 100 pmol of ribosomes 
were incubated with lOOpM compound, followed by treatment with the 
chemical modifying agents kethoxal (KE) and dimethyl sulfate (DMS, not 
shown). Following chemical modification, rRNA was prepared and analyzed 
in primer extension reactions using end-labeled oligonucleotides hybridizing 
to rRNA. A sequencing reaction was run in parallel as a marker. 
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FIG. 3. Small molecules involved in nonsense suppression alter the chemical 

footprinting pattern in Domain V of the 28S rRNA. 1 00 pmol of ribosomes 
were incubated with 100 \xM compound, followed by treatment with the 
chemical modifying agents kethoxal (KE) and dimethyl sulfate (DMS, not 
shown). Following chemical modification, rRNA was prepared and analyzed 
in primer extension reactions using end-labeled oligonucleotides hybridizing 
to rRNA. A sequencing reaction was run in parallel as a marker. 

FIG. 4. Small molecules involved in nonsense suppression alter the chemical 

footprinting pattern in Domain II (GTPase Center) of the 28S rRNA. 100 
pmol of ribosomes were incubated with 100 nM compound, followed by 
treatment with the chemical modifying agents kethoxal (KE) and dimethyl 
sulfate (DMS, not shown). Following chemical modification, rRNA was 
prepared and analyzed in primer extension reactions using end-labeled 
oligonucleotides hybridizing to rRNA. A sequencing reaction was run in 
parallel as a marker. 

FIG. 5. Small molecules involved in nonsense suppression alter the chemical 

footprinting pattern of domain II of the 28S rRNA. 100 pmol of ribosomes 
were incubated with 100 jxM compound, followed by treatment with 
chemical modifying agents dimethyl sulfate (DMS) and kethoxal (KE). 
Following chemical modification, rRNA was prepared and analyzed in 
primer extension reactions using end-labeled oligonucleotides hybridizing to 
rRNA. A sequencing reaction was run in parallel as a marker. 

FIG. 6. A specific region of Domain II can compete for compound binding and 

prevents nonsense suppression in vitro. The in vitro nonsense suppression 
assay was performed using a luciferase construct with a UGA nonsense 
mutation. 0.1 \jM compound was present in the reaction to induce nonsense 
suppression. Competitor RNA corresponding to Domain II was added at the 
indicated concentrations (0, 1, 2.5, 5, 7.5, 10 pM) to titrate the small 
molecule and prevent nonsense suppression. 

5. DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to methods for identifying compounds that bind 
to preselected target elements of nucleic acids, in particular, RNAs, including but not 
limited to preselected target RNA sequencing structural motifs, or structural elements. In 
particular, the present invention relates to identifying test compounds that bind to regions of 
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the 28S rRNA and analogs thereof. Methods are described in which a preselected target 
RNA having a detectable label is used to screen a library of test compounds. Any 
complexes formed between the target RNA and a member of the library are identified using 
methods that detect the labeled target RNA bound to a test compound. In particular, the 
present invention relates to methods for using a target RNA having a detectable label to 
screen a bead-based library of test compounds. Compounds in the bead-based library that 
bind to the labeled target RNA will form a bead-based detectably labeled complex, which 
can be separated from the unbound target RNA in the liquid phase by a number of physical 
means, such as, but not limited to, flow cytometry, affinity chromatography, manual batch 
mode separation, suspension of beads in electric fields, and microwave of the bead-based 
detectably labeled complex. The detectably labeled complex can then be identified by the 
label on the target RNA and removed from the uncomplexed, unlabeled test compounds in 
the library. The structure of the test compound attached to the labeled RNA is then 
ascertained by de novo structure determination of the test compounds using, for example, 
mass spectrometry or nuclear magnetic resonance ("NMR")- 

Thus, the methods of the present invention provide a simple, sensitive assay 
for high-throughput screening of libraries of test compounds, in which the test compounds 
of the library that specifically bind a preselected target nucleic acid are easily distinguished 
from non-binding members of the library. The structures of the binding molecules are 
20 ascertained by de novo structure determination of the test compounds using, for example, 
mass spectrometry or nuclear magnetic resonance ("NMR"). The test compounds so 
identified are useful for any purpose to which a binding reaction may be put, for example in 
assay methods, diagnostic procedures, cell sorting, as inhibitors of target molecule function, 
as probes, as sequestering agents and lead compounds for development of therapeutics, and 
25 the like. Small organic compounds that are identified to interact specifically with the target 
RNA molecules are particularly attractive candidates as lead compounds for the 
development of therapeutic agents. 

The assay of the invention reduces bias introduced by competitive binding 
assays which require the identification and use of a host cell factor (presumably essential for 
30 modulating RNA function) as a binding partner for the target RNA. The assays of the 
present invention are designed to detect any compound or agent that binds to 28S rRNA, 
preferably under physiologic conditions. Such agents can then be tested for biological 
activity, without establishing or guessing which host cell factor or factors is required for 
35 modulating the function and/or activity of 28S rRNA. 
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Section 5.1 describes examples of 28S rRNA and analogs thereof that can be 
used as preselected target RNAs. Section 5.2 describes detectable labels for target nucleic 
acids that are useful in the methods of the invention. Section 5.3 describes libraries of test 
compounds. Section 5.4 provides conditions for binding a labeled target RN A to a test 
compound of a library and detecting RN A binding to a test compound using the methods of 
the invention. Section 5.5 provides methods for separating complexes of target RNAs 
bound to a test compound from an unbound RNA. Section 5.6 describes methods for 
identifying test compounds that are bound to the target RNA. Section 5.7 describes a 
secondary, biological screen of test compounds identified by the methods of the invention to 
test the effect of the test compounds in vivo. Section 5.8 describes the use of test 
compounds identified by the methods of the invention for treating or preventing a disease or 
abnormal condition in mammals. 



5.1. 28S rRNA and Analogs Thereof 

The ribosome is a 2.5-MDa ribonucleoprotein complex involved in the 
decoding of genetic material from mRNA to proteins. A combination of biophysical and 
biochemical analysis have provided three dimensional models of the ribosome as well as 
detailed analyses into the mechanism of the individual steps in translation (see, e.g., Green 
&Noller, 1997, Annu. Rev. Biochem. 66:679-716; Cate era/., 1999, Science 
285(5436):2095-2104; and Ban et al, 2000, Science.289(5481): 905-920.). 

The 28S rRNA is one of the ribosomal RNA components of the 60S subunit 
of eucaryotic ribosomes. The 28S rRNA sequences are conserved when expressed as 
mature rRNAs, although the 28S rRNA contains variable sequence tracts that are 
interspersed among conserved core sequences and lacking in the counterpart bacterial 23S 
rRNA (see, e.g., Hancock & Dover, 1988, Mol. Biol. Evol. 5:377-391). A diagram of the 
28S rRNA is presented in Figure 1, with domains II and V circled. As indicated in Figure 1, 
a GTPase center has been mapped to domain II and the peptidyl transferase center has been 

mapped to domain V. 

Compounds that interact in these regions or modulate local changes within 
these domains of the ribosome (e.g., alter base pairing interactions, base modification or 
modulate binding of trans-acting factors that bind to these regions) have the potential to 
modulate translation termination. These regions, i.e., domains II and V are conserved from 
prokaryotes to eukaryotes, but the role of these regions in modulating translation 
termination has not been realized in eukaryotes. In bacteria, when a short RNA fragment, 
complementary to the R coli 23S rRNA segment comprising nucleotides 735 to 766 (in 
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domain II), is expressed in vivo, suppression of UGA nonsense mutations, but not UAA or 
UAG, results (Chernyaeva et al. , 1999, J Bacteriol 181 -.5257-5262). Other regions of the 
23S rRNA in E. coli have been implicated in nonsense suppression including the GTPase 
center in domain H (nt 1 034-1 120; Jemiolo et al, 1995, Proc. Nat. Acad. Sci. 
92:12309-12313). 

Genetic mutations in bacteria have also identified rRNA mutations that 
either increase the level of frameshifting in the trpE or the suppression of a nonsense 
mutations in the trpA gene (reviewed in Green &Noller, 1997, Annu. Rev. Biochem. 
66:679-716). The frameshifting mutations mapped to domains IV and V of the 23S rRNA. 
Disruption of the interaction of the CCA end of the tRNA with the peptidyl transferase 
center of the ribosome has been demonstrated to result in an increased translational error 
frequency (reviewed in Green & Noller, 1997, Annu. Rev. Biochem. 66:679-716). 

Regions of the 28S rRNA involved in frameshifting, nonsense mutation 
suppression, GTPase activity, or peptidyl transferase are attractive target RNAs to identify 
compounds that modulate premature translation termination and/or nonsense mediated 
decay. The interference of atest compound with one or more of these functions could 
potentially mediate translation termination by interfering with premature translation 
termination. Without being bound by theory, a test compound could potentially mediate 
translation termination by causing readthrough of a premature translation codon, therefore 

20 

allowing the synthesis of the full-length protein. 

In a preferred embodiment, the target RNA comprises a region of 28S rRNA 
corresponding to domain n (see, e.g., nucleotides 1310 to 2333 of accession number 
Ml 1 167) or domain V of 28S rRNA (see, e.g., nucleotides 3859 to 4425 of accession 
number Ml 1 167) or an analog thereof. It will become apparent to one of skill in the art that 

25 an analog of the 28S rRNA has an analogous structure and function to native 28S rRNA. 
For example, an analog of human 28S rRNA includes, but is not limited to, a human 28S 
rRNAretropseudogene (see, e.g., Wang et al., 1997, Gene 196:105-111, Accession Number 
L20636). Regions corresponding to domain H or domain V of the 28S rRNA pseudogene 
could be used as target RNAs in the present invention. In a preferred embodiment, the 28S 

30 rRNA is a human 28S rRNA, although the teachings of the present invention are applicable 
to mammals. 

Synthesis of the target RNAs, i.e., regions of 28S rRNA, can be performed 
by methods known to one of skill in the art (see, e.g., Sambrook et al, 1989, Molecular 
Cloning, A Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory Press, Cold Spring 
35 Harbor, New York and Glover, D.M. (ed.), 1985, DNA Cloning: A Practical Approach, 
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MRL Press, Ltd., Oxford, U.K. Vol. I, II). In a preferred embodiment, the target RNAs are 
cloned as DNAs downstream of a promoter, such as but not limited to T7, T3, or Sp6 
promoters, and in vitro transcribed with the corresponding polymerase. A detectable label 
can be incorporated into the in vitro transcribed RNA or alternatively, the target RNA is 
end-labeled (see Section 5.2 infra). Alternatively, the target RNA can be amplified by 
polymerase chain reaction with a primer containing an RNA promoter and subsequently in 
vitro transcribed, as described in U.S. Patent No. 6,271, 002, which is incorporated by 
reference in its entirety. 
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5.2. Tletectablv Labeled Tar get RNAs 
Target nucleic acids, including but not limited to RNA and DNA, useful in 
the methods of the present invention have a label that is detectable via conventional 
spectroscopic means or radiographic means. Preferably, target nucleic acids are labeled 
with a covalently attached dye molecule. Useful dye-molecule labels include, but are not 
limited to, fluorescent dyes, phosphorescent dyes, ultraviolet dyes, infrared dyes, and visible 
dyes. Preferably, the dye is a visible dye. 

Useful labels in the present invention can include, but are not limited to, 
spectroscopic labels such as fluorescent dyes (e.g., fluorescein and derivatives such as 
fluorescein isothiocyanate (FITC) and Oregon Green™, rhodamine and derivatives (e.g., 
Texas red, tetramethylrhodimine isothiocynate (TRITC), bora-3a,4a-diaza-s-indacene 
(BODIPY®) and derivatives, etc.), digoxigenin, biotin, phycoerythrin, AMCA, CyDye™, 
and the like), radiolabels (e.g., 3 H, ,25 1, 3S S, ,4 C, 32 P, 33 P, etc.), enzymes (e.g., horse radish 
peroxidase, alkaline phosphatase etc.), spectroscopic colorimetric labels such as colloidal 
gold or colored glass or plastic (e.g. polystyrene, polypropylene, latex, etc.) beads, or 
nanoparticles -nanoclusters of inorganic ions with defined dimension from 0.1 to 1000 nm. 
The label may be coupled directly or indirectly to a component of the detection assay (e.g., 
the detection reagent) according to methods well known in the art. A wide variety of labels 
may be used, with the choice of label depending on sensitivity required, ease of conjugation 
3Q with the compound, stability requirements, available instrumentation, and disposal 
provisions. 

In one embodiment, nucleic acids that are labeled at one or more specific 
locations are chemically synthesized using phosphoramidite or other solution or solid-phase 
methods. Detailed descriptions of the chemistry used to form polynucleotides by the 
phosphoramidite method are well known (see, e.g., Caruthers etal, U.S. Pat. Nos. 
35 4,458,066 and 4,415,732; Caruthers et al, 1982, Genetic Engineering 4:1-17; Users Manual 
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Model 392 and 394 Polynucleotide Synthesizers, 1990, pages 6-1 through 6-22, Applied 
Biosystems, PartNo. 901237; Ojwang, et al, 1997, Biochemistry, 36:6033-6045). The 
phosphoramidite method of polynucleotide synthesis is the preferred method because of its 
efficient and rapid coupling and the stability of the starting materials. The synthesis is 
5 performed with the growing polynucleotide chain attached to a solid support, such that 
excess reagents, which are generally in the liquid phase, can be easily removed by washing, 
decanting, and/or filtration, thereby eliminating the need for purification steps between 
synthesis cycles. 

The following briefly describes illustrative steps of a typical polynucleotide 
10 synthesis cycle using the phosphoramidite method. First, a solid support to which is 
attached a protected nucleoside monomer at its 3' terminus is treated with acid, e.g., 
trichloroacetic acid, to remove the 5'-hydroxyl protecting group, freeing the hydroxyl group 
for a subsequent coupling reaction. After the coupling reaction is completed an activated 
intermediate is formed by contacting the support-bound nucleoside with a protected 
15 nucleoside phosphoramidite monomer and a weak acid, e.g., tetrazole. The weak acid 
protonates the nitrogen atom of the phosphoramidite forming a reactive intermediate. 
Nucleoside addition is generally complete within 30 seconds. Next, a capping step is 
performed, which terminates any polynucleotide chains that did not undergo nucleoside 
addition. Capping is preferably performed using acetic anhydride and 1-methylimidazole. 
20 The phosphite group of the intemucleotide linkage is then converted to the more stable 
phosphotriester by oxidation using iodine as the preferred oxidizing agent and water as the 
oxygen donor. After oxidation, the hydroxyl protecting group of the newly added 
nucleoside is removed with a protic acid, e.g., trichloroacetic acid or dichloroacehc acid, 
and the cycle is repeated one or more times until chain elongation is complete. After 
25 synthesis, the polynucleotide chain is cleaved from the support using a base, e.g., 

ammonium hydroxide or t-butyl amine. The cleavage reaction also removes any phosphate 
protecting groups, e.g., cyanoethyl. Finally, the protecting groups on the exocyclic amines 
of the bases and any protecting groups on the dyes are removed by treating the 
polynucleotide solution in base at an elevated temperature, e.g., at about 55°C. Preferably 
30 the various protecting groups are removed using ammonium hydroxide or t-butyl amine. 

Any of the nucleoside phosphoramidite monomers can be labeled using 
standard phosphoramidite chemistry methods (Hwang et al, 1999, Proc. Natl. Acad. Sci. 
USA 96(23):12997-13002; Ojwang et al, 1997, Biochemistry. 36:6033-6045 and references 
cited therein). Dye molecules useful for covalently coupling to phosphoramidites preferably 
35 comprise a primary hydroxyl group that is not part of the dye's chromophore. Illustrative 
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dye molecules include, but are not limited to, disperse dye CAS 4439-31-0, disperse dye 
CAS 6054-58-6, disperse dye CAS 4392-69-2 (Sigma-Aldrich, St. Louis, MO), disperse 
red and 1-pyrenebutanol (Molecular Probes, Eugene, OR). Other dyes useful for coupling 
to phosphoramidites will be apparent to those of skill in the art, such as fluorescein, cy3, 
5 and cy5 fluorescent dyes, and may be purchased from, e.g. , Sigma-Aldrich, St. Louis, MO 
or Molecular Probes, Inc., Eugene, OR. 

In another embodiment, dye-labeled target RNA molecules are synthesized 
enzymatically using in vitro transcription (Hwang et al. , 1999, Proc. Natl. Acad. Sci. USA 
96(23):12997-13002 and references cited therein). In this embodiment, a template DNA is 
10 denatured by heating to about 90°C and an oligonucleotide primer is annealed to the 

template DNA, for example by slow-cooling the mixture of the denatured template and the 
primer from about 90°C to room temperature. A mixture of ribonucleoside-5 '-triphosphates 
capable of supporting template-directed enzymatic extension of the primed template (e.g., a 
mixture including OTP, ATP, CTP, and UTP), including one or more dye-labeled 
15 ribonucleotides (Sigma-Aldrich, St. Louis, MO), is added to the primed template. Next, a 
polymerase enzyme is added to the mixture under conditions where the polymerase enzyme 
is active, which are well-known to those skilled in the art A labeled polynucleotide is 
formed by the incorporation of the labeled ribonucleotides during polymerase-mediated 
strand synthesis. 

In yet another embodiment of the invention, nucleic acid molecules are end- 
labeled after their synthesis. Methods for labeling the 5'-end of an oligonucleotide include 
but are by no means limited to: (i) periodate oxidation of a 5 '-to-5' -coupled ribonucleotide, 
followed by reaction with an amine-reactive label (Heller & Morisson, 1985, m Rapid 
Detection and Identification of Infectious Agents, D.T. Kingsbury and S. Falkow, eds., pp. 

25 245-256, Academic Press); (ii) condensation of ethylenediamine with 5'-phosphorylated 
polynucleotide, followed by reaction with an amine reactive label (Morrison, European 
Patent Application 232 967); (iii) introduction of an aliphatic amine substituent usmg an 
aminohexyl phosphite reagent in solid-phase DNA synthesis, followed by reaction with an 
amine reactive label (Cardullo */ a/., 1988, Proc. Natl. Acad. Sci. USA 85:8790-8794); and 

30 (iv) introduction of a thiophosphate group on the 5'-end of the nucleic acid, using 
phosphatase treatment followed by end-labeling with ATP- S and kinase, which reacts 
specifically and efficiently with maleimide-labeled fluorescent dyes (Czworkowski et al, 

1991, Biochem. 30:4821-4830). 

A detectable label should not be incorporated into a target nucleic acid at the 
35 specific binding site at which test compounds are likely to bind, since the presence of a 
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covalently attached label might interfere stericaUy or chemically with the binding of the test 
compounds at this site. Accordingly, if the region of the target nucleic acid that binds to a 
host cell factor is known, a detectable label is preferably incorporated into the nucleic acid 
molecule at one or more positions that are spatially or sequentially remote from the binding 
region. 

After synthesis, the labeled target nucleic acid can be purified using standard 
techniques known to those skilled in the art (see Hwang et al., 1999, Proc. Natl. Acad. Sci. 
USA 96(23): 12997-13002 and references cited therein). Depending on the length of the 
target nucleic acid and the method of its synthesis, such purification techniques include, but 
are not limited to, reverse-phase high-performance liquid chromatography ("reverse-phase 
HPLC"), fast performance liquid chromatography ("FPLC"), and gel purification. After 
purification, the target RNA is refolded into its native conformation, preferably by heating 
to approximately 85-95°C and slowly cooling to room temperature in a buffer, e.g., a buffer 
comprising about 50 mM Tris-HCl, pH 8 and 100 mM NaCl. 

In another embodiment, the target nucleic acid can also be radiolabeled. A 
radiolabel, such as, but not limited to, an isotope of phosphorus, sulfur, or hydrogen, may be 
incorporated into a nucleotide, which is added either after or during the synthesis of the 
target nucleic acid. Methods for the synthesis and purification of radiolabeled nucleic acids 
are well known to one of skill in the art. See, e.g., Sambrook et al, 1989, in Molecular 
20 Cloning: A Laboratory Manual, pp 10.2-10.70, Cold Spring Harbor Laboratory Press, and 
the references cited therein, which are hereby incorporated by reference in their entireties. 

In another embodiment, the target nucleic acid can be attached to an 
inorganic nanoparticle. A nanoparticle is a cluster of ions with controlled size from 0.1 to 
1000 nm comprised of metals, metal oxides, or semiconductors including, but not limited to 
25 Ag 2 S, ZnS, CdS, CdTe, Au, or Ti0 2 . Nanoparticles have unique optical, electronic and 
catalytic properties relative to bulk materials which can be adjusted according to the size of 
the particle. Methods for the attachment of nucleic acids are well known to one of skill in 
the art (see, e.g., Niemeyer, 2001, Angew. Chem. Int Ed. 40: 4129-4158, International 
Patent Publication WO/0218643, and the references cited therein, the disclosures of which 
are hereby incorporated by reference in their entireties). 

5.3. libraries of Small Molecules 

Libraries screened using the methods of the present invention can comprise a 
variety of types of test compounds on solid supports. In all of the embodiments described 
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below, all of the libraries can be synthesized on solid supports or the compounds of the 
library can be attached to solid supports by linkers. 

In some embodiments, the test compounds are nucleic acid or peptide 
molecules. In a non-hmiting example, peptide molecules can exist in a phage display 
5 library. In other embodiments, types of test compounds include, but are not limited to, 

peptide analogs including peptides comprising non-naturaUy occurring amino acids, e.g., D- 
amino acids, phosphorous analogs of amino acids, such as a-amino phosphoric acids and a- 
amino phosphoric acids, or amino acids having non-peptide linkages, nucleic acid analogs 
such as phosphorothioates and PNAs, hormones, antigens, synthetic or naturally occurring 
10 drugs, opiates, dopamine, serotonin, catecholamines, thrombin, acetylcholine, 

prostaglandins, organic molecules, pheromones, adenosine, sucrose, glucose, lactose and 
galactose. Libraries of polypeptides or proteins can also be used. 

In a preferred embodiment, the combinatorial libraries are small organic 
molecule libraries, such as, but not limited to, benzodiazepines, isoprenoids, 
15 thiazolidinones, metathiazanones, pyrrolidines, morpholino compounds, and 

diazepindiones. In another embodiment, the combinatorial libraries comprise peptoids; 
random bio-oligomers; benzodiazepines; diversomers such as hydantoins, benzodiazepines 
and dipeptides; vinylogous polypeptides; nonpeptidal peptidomimetics; oligocarbamates; 
peptidyl phosphonates; peptide nucleic acid libraries; antibody libraries; or carbohydrate 
20 libraries. Combinatorial libraries are themselves commercially available (see, e.g., 
Advanced ChemTech Europe Ltd., Cambridgeshire, UK; ASINEX, Moscow Russia; 
BioFocus pic, Sittingbourne, UK; Bionet Research (A division of Key Organics Limited ), 
Camelford, UK; ChemBridge Corporation, San Diego, California; ChemDiv Inc, San 
Diego, California.; ChemRx Advanced Technologies, South San Francisco, California; 
25 ComGenex Inc., Budapest, Hungary; Evotec OAI Ltd, Abingdon, UK; IF LAB Ltd., Kiev, 
Ukraine; Maybridge pic, Cornwall, UK; PharmaCore, Inc., North Carolina; SIDDCO Inc, 
Tucson, Arizona; TimTec Inc, Newark, Delaware; Tripos Receptor Research Ltd, Bude, 

UK; Toslab, Ekaterinburg, Russia). 

In one embodiment, the combinatorial compound library for the methods of 
30 the present invention may be synthesized. There is a great interest in synthetic methods 

directed toward the creation of large collections of small organic compounds, or libraries, 

which could be screened for pharmacological, biological or other activity (Dolle, 2001, J. 

Comb. Chem. 3:477-517; Hall et aL, 2001, ibid. 3:125-150; Dolle, 2000, ibid 2:383-433; 

Dolle, 1999, ibid 1:235-282). The synthetic methods applied to create vast combinatorial 
35 libraries are performed in solution or in the solid phase, on a solid support. Solid-phase 
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synthesis makes it easier to conduct multi-step reactions and to drive reactions to 
completion with high yields because excess reagents can be easily added and washed away 
after each reaction step. Solid-phase combinatorial synthesis also tends to improve 
isolation, purification and screening. However, the more traditional solution phase 
chemistry supports a wider variety of organic reactions than solid-phase chemistry. 
Methods and strategies for the synthesis of combinatorial libraries can be found in A 
Practical Guide to Combinatorial Chemistry, AW. Czarnik and S.H. Dewitt, eds., 
American Chemical Society, 1997; The Combinatorial Index, B.A. Bunin, Academic Press, 
1998; Organic Synthesis on Solid Phase, F.Z. Dorwald, Wiley-VCH, 2000; and Solid-Phase 
Organic Syntheses, Vol. 1, A.W. Czarnik, ed., Wiley Interscience, 2001. 

Combinatorial compound libraries of the present invention may be 
synthesized using apparatuses described in US Patent No. 6,358,479 to Frisina et al, U.S. 
Patent No. 6,190,619 to Kilcoin et al, US Patent No. 6,132,686 to Gallup et al, US Patent 
No. 6,126,904 to Zuellig et al, US Patent No. 6,074,613 to Harness et al, US Patent No. 
6,054,100 to Stanchfield etal, and US Patent No. 5,746,982 to Saneii et al which are 
hereby incorporated by reference in their entirety. These patents describe synthesis 
apparatuses capable of holding a plurality of reaction vessels for parallel synthesis of 
multiple discrete compounds or for combinatorial libraries of compounds. 

In one embodiment, the combinatorial compound library can be synthesized 
20 in solution. The method disclosed in U.S. Patent No. 6,194,612 to Boger et al, which is 
hereby incorporated by reference in its entirety, features compounds useful as templates for 
solution phase synthesis of combinatorial libraries. The template is designed to permit 
reaction products to be easily purified from unreacted reactants using liquid/Uquid or 
solid/liquid extractions. The compounds produced by combinatorial synthesis using the 
25 template will preferably be small organic molecules. Some compounds in the library may 
mimic the effects of non-peptides or peptides. In contrast to solid-phase synthesis of 
combinatorial compound libraries, liquid-phase synthesis does not require the use of 
specialized protocols for monitoring the individual steps of a multistep solid-phase 
synthesis (Egner et al, 1995, J.Org. Chem. 60:2652; Anderson et al, 1995, J. Org. Chem. 
30 60:2650; Fitch et al, 1994, J. Org. Chem. 59:7955; Look et al, 1994, J. Org. Chem. 
49:7588; Metzger et al, 1993, Angew. Chem., Int. Ed. Engl. 32:894; Youngquist et al, 
1994, Rapid Commun. Mass Spect. 8:77; Chu et al, 1995, J. Am. Chem. Soc. 117:5419; 
Brummel et al, 1994, Science 264:399; Stevanovic et al, 1993, Bioorg. Med. Chem. Lett. 
3:431). 
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Combinatorial compound libraries useful for the methods of the present 
invention can be synthesized on solid supports. In one embodiment, a split synthesis 
method, a protocol of separating and mixing solid supports during the synthesis, is used to 
synthesize a library of compounds on solid supports (see Lam et al., 1997, Chem. Rev. 
97:41-448; Ohlmeyer et al, 1993, Proc. Natl. Acad. Sci. USA 90:10922-10926 and 
references cited therein). Each solid support in the final library has substantially one type of 
test compound attached to its surface. Other methods for synthesizing combinatorial 
libraries on solid supports, wherein one product is attached to each support, will be known 
to those of skill in the art (see, e.g., Nefzi et aL, 1997, Chem. Rev. 97:449-472 and US 
Patent No. 6,087,186 to Cargill et al. which are hereby incorporated by reference in their 
entirety). 

As used herein, the term "solid support" is not limited to a specific type of 
solid support. Rather a large number of supports are available and are known to one skilled 
in the art. Solid supports include silica gels, resins, derivatized plastic films, glass beads, 
cotton, plastic beads, polystyrene beads, doped polystyrene beads (as described by Fenniri et 
al, 2000, J. Am. Chem. Soc. 123:8151-8152), alumina gels, and polysaccharides. A 
suitable solid support may be selected on the basis of desired end use and suitability for 
various synthetic protocols. For example, for peptide synthesis, a solid support can be a 
resin such as p-methylbenzhydrylamine (pMBHA) resin (Peptides International, Louisville, 
20 KY), polystyrenes (e.g. , PAM-resin obtained from Bachem Inc., Peninsula Laboratories, 
etc.), including chloromethylpolystyrene, hydroxymethylpolystyrene and 
aminomethylpolystyrene, poly (dimethylacrylamide)-grafted styrene co-divinyl-benzene 
(e.g., POLYHIPE resin, obtained from Aminotech, Canada), polyamide resin (obtained 
from Peninsula Laboratories), polystyrene resin grafted with polyethylene glycol (e.g., 
TENTAGEL or ARGOGEL, Bayer, Tubingen, Germany) polydimethylacrylamide resin 
(obtained from Milligen/Biosearch, California), or Sepharose (Pharmacia, Sweden). In 
another embodiment, the solid support can be a magnetic bead coated with streptavidin, 
such as Dynabeads Streptavidin (Dynal Biotech, Oslo, Norway). 

In one embodiment, the solid phase support is suitable for in vivo use, i.e., it 
can serve as a carrier or support for administration of the test compound to a patient (e.g., 
TENTAGEL, Bayer, Tubingen, Germany). In a particular embodiment, the solid support is 
palatable and/or orally ingestable. 

In some embodiments of the present invention, compounds can be attached 
to solid supports via linkers. Linkers can be integral and part of the solid support, or they 
35 may be nonintegral that are either synthesized on the solid support or attached thereto after 
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synthesis. Linkers are useful not only for providing points of test compound attachment to 
the solid support, but also for allowing different groups of molecules to be cleaved from the 
solid support under different conditions, depending on the nature of the linker. For 
example, linkers can be, inter alia, electrophilically cleaved, nucleophilically cleaved, 
photocleavable, enzymatically cleaved, cleaved by metals, cleaved under reductive 
conditions or cleaved under oxidative conditions. 

In another embodiment, the combinatorial compound libraries can be 
assembled in situ using dynamic combinatorial chemistry as described in European Patent 
Application 1,118,359 Al to Lehn; Hue & Nguyen, 2001, Comb. Chem. High Throughput. 
Screen. 4:53-74; Lehn and Eliseev, 2001, Science 291:2331-2332; Cousins et al 2000, 
Curr. Opin. Chem. Biol. 4: 270-279; and Karan & Miller, 2000, Drug. Disc. Today 5:67-75 
which are incorporated by reference in their entirety. 

Dynamic combinatorial chemistry uses non-covalent interaction with a target 
biomolecule, including but not limited to a protein, RNA, or DNA, to favor assembly of the 
most tightly binding molecule that is a combination of constituent subunits present as a 
mixture in the presence of the biomolecule. According to the laws of thermodynamics, 
when a collection of molecules is able to combine and recombine at equilibrium through 
reversible chemical reactions in solution, molecules, preferably one molecule, that bind 

20 most tightly to a templating biomolecule will be present in greater amount than all other 
possible combinations. The reversible chemical reactions include, but are not limited to, 
imine, acyl-hydrazone, amide, acetal, or ester formation between carbonyl-containing 
compounds and amines, hydrazines, or alcohols; thiol exchange between disulfides; alcohol 
exchange in borate esters; Diels-Alder reactions; thermal- or photoinduced sigmatropic or 

^ ^ electrocyclic rearrangements; or Michael reactions. 

In the preferred embodiment of this technique, the constituent components of 
the dynamic combinatorial compound library are allowed to combine and reach equilibrium 
in the absence of the target RNA and then incubated in the presence of the target RNA, 
preferably at physiological conditions, until a second equilibrium is reached. The second, 

3Q perturbed, equilibrium (the so-called "templated mixture") can, but need not necessarily, be 
fixed by a further chemical transformation, including but not limited to reduction, oxidation, 
hydrolysis, acidification, or basification, to prevent restoration of the original equilibrium 
when the dynamical combinatorial compound library is separated from the target RNA. 

In the preferred embodiment of this technique, the predominant product or 

^ products of the templated dynamic combinatorial library can separated from the minor 
products and directly identified. In another embodiment, the identity of the predominant 
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product or products can be identified by a deconvolution strategy involving preparation of 
derivative dynamic combinatorial libraries, as described in European Patent Application 
1,118,359 Al, which is incorporated by reference in its entirety, whereby each component 
of the mixture is, preferably one-by-one but possibly group- wise, left out of the mixture and 
5 the ability of the derivative library mixture at chemical equilibrium to bind the target RNA 
is measured. The components whose removal most greatly reduces the ability of the 
derivative dynamic combinatorial library to bind the target RNA are likely the components 
of the predominant product or products in the original dynamic combinatorial library. 

10 5.4. T,ihrar y Screening 

After a target nucleic acid, such as but not limited to RNA or DNA, is 
labeled and a test compound library is synthesized or purchased or both, the labeled target 
nucleic acid is used to screen the library to identify test compounds that bind to the nucleic 
acid. Screening comprises contacting a labeled target nucleic acid with an individual, or 
1 5 small group, of the components of the compound library. Preferably, the contacting occurs 
in an aqueous solution, and most preferably, under physiologic conditions. The aqueous 
solution preferably stabilizes the labeled target nucleic acid and prevents denaturation or 
degradation of the nucleic acid without interfering with binding of the test compounds. The 
aqueous solution can be similar to the solution in which a complex between the target RNA 
20 and its corresponding host cell factor is formed in vitro. For example, TK buffer, which is 
commonly used to form Tat protein-TAR RNA complexes in vitro, can be used in the 
methods of the invention as an aqueous solution to screen a library of test compounds for 
TAR RNA binding compounds. 

The methods of the present invention for screening a library of test 
compounds preferably comprise contacting a test compound with a target nucleic acid in the 
presence of an aqueous solution, the aqueous solution comprising a buffer and a 
combination of salts, preferably approximating or mimicking physiologic conditions. The 
aqueous solution optionally further comprises non-specific nucleic acids, such as, but not 
limited to, DNA; yeast tRNA; salmon sperm DNA; homoribopolymers such as, but not 
limited to, poly IC, polyA, polyU, and polyC; and non-specific RNA. The non-specific 
RNA may be an unlabeled target nucleic acid having a mutation at the binding site, which 
renders the unlabeled nucleic acid incapable of interacting with a test compound at that site. 
For example, if dye-labeled TAR RNA is used to screen a library, unlabeled TAR RNA 
having a mutation in the uracil 23/cytosine 24 bulge region may also be present in the 
aqueous solution. Without being bound by any theory, the addition of unlabeled RNA that 

•» 
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is essentially identical to the dye-labeled target RNA except for a mutation at the binding 
site might ininimize interactions of other regions of the dye-labeled target RNA with test 
compounds or with the solid support and prevent false positive results. 

The solution further comprises a buffer, a combination of salts, and 
5 optionally, a detergent or a surfactant. The pH of the solution typically ranges from about 5 
to about 8, preferably from about 6 to about 8, most preferably from about 6.5 to about 8. A 
variety of buffers may be used to achieve the desired pH. Suitable buffers include, but are 
not limited to, Tris, Mes, Bis-Tris, Ada, Aces, Pipes, Mopso, Bis-Tris propane, Bes, Mops, 
Tes, Hepes, Dipso, Mobs, Tapso, Trizma, Heppso, Popso, TEA, Epps, Tricine, Gly-Gly, 
10 Bicine, and sodium-potassium phosphate. The buffering agent comprises from about 10 
mM to about 100 mM, preferably from about 25 mM to about 75 mM, most preferably from 
about 40 mM to about 60 mM buffering agent. The pH of the aqeuous solution can be 
optimized for different screening reactions, depending on the target RNA used and the types 
of test compounds in the library, and therefore, the type and amount of the buffer used in the 
1 5 solution can vary from screen to screen. In a preferred embodiment, the aqueous solution 
has a pH of about 7.4, which can be achieved using about 50 mM Tris buffer. 

In addition to an appropriate buffer, the aqueous solution further comprises a 
combination of salts, from about 0 mM to about 100 mM KC1, from about 0 mM to about 1 
M NaCl, and from about 0 mM to about 200 mM MgCl 2 . In a preferred embodiment, the 
20 combination of salts is about 100 mM KC1, 500 mM NaCl, and 10 mM MgCl 2 . Without 
being bound by any theory, Applicant has found that a combination of KC1, NaCl, and 
MgCl 2 stabilizes the target RNA such that most of the RNA is not denatured or digested 
over the course of the screening reaction. The optional concentration of each salt used in 
the aqueous solution is dependent on the particular target RNA used and can be determined 
using routine experimentation. 

The solution optionally comprises from about 0.01% to about 0.5% (w/v) of 
a detergent or a surfactant Without being bound by any theory, a small amount of detergent 
or surfactant in the solution might reduce non-specific binding of the target RNA to the 
solid support and control aggregation and increase stability of target RNA molecules. 
30 Typical detergents useful in the methods of the present invention include, but are not 

limited to, anionic detergents, such as salts of deoxycholic acid, 1-heptanesulfonic acid, N- 
laurylsarcosine, lauryl sulfate, 1-octane sulfonic acid and taurocholic acid; canonic 
detergents such as benzalkonium chloride, cetylpyridinium, methylbenzethonium chloride, 
and decamethonium bromide; zwitterionic detergents such as CHAPS, CHAPSO, alkyl 
35 betaines, alkyl amidoalkyl betaines, N-dodecyl-N,N-dimethyl-3-ammonio-l - 
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propanesulfonate, and phosphatidylcholine; and non-ionic detergents such as n-decyl a-D- 
glucopyranoside, n-decyl B-D-maltopyranoside, n-dodecyl B-D-maltoside, n-octyl B-D- 
glucopyranoside, sorbitan esters, n-tetradecyl B-D-maltoside, octylphenoxy 
polyethoxyethanol (Nonidet P-40), nonylphenoxypolyethoxyethanol (NP-40), and tritons. 
Preferably, the detergent, if present, is a nonionic detergent. Typical surfactants useful in 
the methods of the present invention include, but are not limited to, ammonium lauryl 
sulfate, polyethylene glycols, butyl glucoside, decyl glucoside, Polysorbate 80, lauric acid, 
myristic acid, palmitic acid, potassium palmitate, undecanoic acid, lauryl betaine, and lauryl 
alcohol. More preferably, the detergent, if present, is Triton X-100 and present in an 
amount of about 0.1% (w/v). 

Non-specific binding of a labeled target nucleic acid to test compounds can 
be further minimized by treating the binding reaction with one or more blocking agents. In 
one embodiment, the binding reactions are treated with a blocking agent, e.g. , bovine serum 
albumin ("BSA")> before contacting with to the labeled target nucleic acid. In another 
embodiment, the binding reactions are treated sequentially with at least two different 
blocking agents. This blocking step is preferably performed at room temperature for from 
about 0.5 to about 3 hours. In a subsequent step, the reaction mixture is further treated with 
unlabeled RNA having a mutation at the binding site. This blocking step is preferably 
performed at about 4°C for from about 12 hours to about 36 hours before addition of the 
dye-labeled target RNA. Preferably, the solution used in the one or more blocking steps is 
substantially similar to the aqueous solution used to screen the library with the dye-labeled 
target RNA, e.g., in pH and salt concentration. 



compound is preferably maintained at 4°C for from about 1 day to about 5 days, preferably 
from about 2 days to about 3 days with constant agitation. To identify the reactions in 
which binding to the labeled target nucleic acid occurred, after the incubation period, bound 
from free compounds are determined using any of the methods disclosed in Section 4.5 
infra. 



immobilized on beads, the beads must then be separated from the unbound target RNA in 
the liquid phase. This can be accomplished by any number of physical means; e.g., 
sedimentation, centrifiigation. Thereafter, a number of methods can be used to separate the 
library beads that are complexed with the labeled target RNA from uncomplexed beads in 



Once contacted, the mixture of labeled target nucleic acid and the test 



5.5. Separation Methods for Screening Test Compounds 
After the labeled target RNA is contacted with the library of test compounds 
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order to isolate the test compound on the bead. Alternatively, mass spectroscopy and NMR 
spectroscopy can be used to simultaneously identify and separate beads complexed to the 
labeled target RNA from uncomplexed beads. 

5.5.1. Flow Cytometry 
In a preferred embodiment, the complexed and non-complexed target nucleic 
acids are separated by flow cytometry methods. Flow cytometers for sorting and examining 
biological cells are well known in the art; this technology can be applied to separate the 
labeled library beads from unlabeled beads. Known flow cytometers are described, for 
example, in U.S. Patent Nos. 4,347,935; 5,464,581; 5,483,469; 5,602,039; 5,643,796; and 
6,21 1,477; the entire contents of which are incorporated by reference herein. Other known 
flow cytometers are the FACS Vantage™ system manufactured by Becton Dickinson and 
Company, and the COPAS™ system manufactured by Union Biometrica. 

A flow cytometer typically includes a sample reservoir for receiving a 
biological sample. The biological sample contains particles (hereinafter referred to as 
"beads") that are to be analyzed and sorted by the flow cytometer. Beads are transported 
from the sample reservoir at high speed (>100 beads/second) to a flow cell in a stream of 
liquid "sheath" fluid. High-frequency vibrations of a nozzle that directs the stream to the 
flow cell causes the stream to partition and form ordered droplets, with each droplet 
containing a single bead. Physical properties of beads can be measured as they intersect a 
laser beam within the cytometer flow cell. As beads move one by one through the 
interrogation point, they cause the laser light to scatter and fluorescent molecules on the 
labeled beads (/.<?., beads complexed with labeled target RNA) become excited. 
Alternatively, if the target nucleic acid is labeled with an inorganic nanoparticle, the beads 
complexed with bound target nucleic acid can be distinguished not only by unique 
fluorescent properties but also on the basis of spectrometry properties (e.g. including but 
not limited to increased optical density due to the reduction of Ag + ions in the presence of 
gold nanoparticles (see, e.g., Taton et al Science 2000, 289: 1757-1760)). 

An appropriate detection system consisting of photomultipiier tubes, 
photodiodes or other devices for measuring light are focused onto the interrogation point 
where the properties are measured. In so doing, information regarding particle size (light 
scatter) and complex formation (fluorescence intensity) is obtained. Particles with the 
desired physical properties are then sorted by a variety of physical means. In one 
embodiment, the beads are sorted by an electrostatic method. To sort beads by an 
electrostatic method, the droplets containing the beads with the desired physical properties 
) 
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are electrically charged and deflected from the trajectory of uncharged droplets as they pass 
through an electrostatic field formed by two deflection plates held constant at a high 
electrical potential difference. In another embodiment, the beads are sorted by an 
g air-diverting method. To sort beads by an air-diverting method, the droplets containing the 
beads with the desired physical properties are deflected from their trajectory by a focused 
stream of forced air. Both of these embodiments cause the trajectory of beads with the 
desired physical properties to become changed, thereby sorting them from other beads. 
Accordingly, the beads complexed to the labeled target RNA can be collected in an 
j q appropriate collecting vessel. 

Thus, in one embodiment of the present invention, the complexed and 
non-complexed target nucleic acids are separated by flow cytometry methods. In a 
preferred embodiment, the target nucleic acid is labeled with a fluorescent label and the 
complexed and non-complexed target nucleic acids are separated by fluorescence activated 
cell sorting ("FACS"). Such methods are well known to one of skill in the art. 



35 



5.5.2, Affinity Chromatography 

In another embodiment of the invention, the target RNA can be labeled with 
biotin, an antigen, or a ligand. Library beads complexed to the target RNA can be separated 
from uncomplexed beads using affinity techniques designed to capture the labeled moiety 
on the target RNA. For example, a solid support, such as but not limited to, a column or a 
well in a microwell plate coated with avidin/streptavidin, an antibody to the antigen, or a 
receptor for the ligand can be used to capture or immobilize the labeled beads. Complexed 
RNA may or may not be irreversibly bound to the bead by a further transformation between 
the bound RNA and an additional moiety on the surface of the bead. Such linking methods 
include, but are not limited to: photochemical crosslinking between RNA and bead-bound 
molecules such as psoralen, thymidine or uridine derivates either present as monomers, 
oligomers, or as a partially complementary sequence; or chemical ligation by disulfide 
exchange, nitrogen mustards, bond formation between an electrophile and a nucleophile, or 
alkylating reagents. See, e.g., International Patent Publication WO/0146461, the contents of 
which are hereby incorporated by reference. The unbound library beads can be removed 
after the binding reaction by washing the solid phase. If the RNA is irreversibly bound to 
the bead, test compounds can be isolated from the bead following destruction of the bound 
RNA by preferably, but not limited to, enzymatic or chemical alkaline hydrolysis) 
degradation. The library beads bound to the solid phase can then be eluted with any 
solution that disrupts the binding between the labeled target RNA and the solid phase. Such 
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solutions include high salt solutions, low pH solutions, detergents, and chaotropic 
denaturants, and are well known to one of skill in the art.. In another embodiment, the test 
compounds can be eluted from the solid phase by heat. 

In one embodiment, the library of test compounds can be prepared on 
magnetic beads, such as Dynabeads Streptavidin (Dynal Biotech, Oslo, Norway). The 
magnetic bead library can then be mixed with the labeled target RNA under conditions that 
allow binding to occur. The separation of the beads from unbound target RNA in the liquid 
phase can be accomplished using a magnet. After removal of the magnetic field, the bead 
complexed to the labeled RNA may be separated from uncomplexed library beads via the 
label used on the target RNA; e.g. , biotinylated target RNA can be captured by 
avidin/streptavidin; target RNA labeled with antigen can be captured by the appropriate 
antibody; target RNA labeled with ligand can be captured using the appropriate 
immobilized receptor. The captured library bead can then be eluted with any solution that 
disrupts the binding between the labeled target RNA and the immobilized surface. Such 
solutions include high salt solutions, low pH solutions, detergents, and chaotropic 
denaturants, and are well known to one of skill in the art. Complexed RNA may or may not 
be irreversibly bound to the bead by a further transformation between the bound RNA and 
an additional moiety on the surface of the bead. Such linking methods include, but are not 

2Q limited to: photochemical crosslinking between RNA and bead-bound molecules such as 
psoralen, thymidine or uridine derivates either present as monomers, oligomers, or as a 
partially complementary sequence; or chemical ligation by disulfide exchange, nitrogen 
mustards, bond formation between an electrophile and a nucieophile, or alkylating reagents. 
See, e.g., International Patent Publication WO/0146461, the contents of which are hereby 

25 incorporated by reference. If the RNA is irreversibly bound to the bead, test compounds can 
be isolated from the bead following destruction of the bound RNA by enzymatic 
degradation including, but not limited to, ribonucleases A, U 2 , CL3, T„ Phy M, B. cereus or 
chemical degradation including, but not limited to, piperidine-promoted backbone cleavage 
of abasic sites (following treatment with sodium hydroxide, hydrazine, piperidine formate, 

3Q or dimethyl sulfate), or metal-assisted (e.g. nickel(H), cobalt(II), or iron(H)) oxidative 
cleavage. 

In another embodiment, the preselected target RNA can be labeled with a 
heavy metal tag and incubated with the library beads to allow binding of the test compounds 
to the target RNA. The separation of the labeled beads from unlabeled beads can be 
35 accomplished using a magnetic field. After removal of the magnetic field, the test 

compound can be eluted with any solution that disrupts the binding between the preselected 
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target RNA and the test compound. Such solutions include high salt solutions, low pH 
solutions, detergents, and chaotropic denaturants, and are well known to one of skill in the 
art. In another embodiment, the test compounds can be eluted from the solid phase by heat. 

5 

5.5.3* Manual Batch 

In one embodiment, a manual "batch" mode is used for separating 
complexed beads. To explore a bead-based library within a reasonable time period, the 
primary screens should be operated with sufficient throughput. To do this, the target 
nucleic acid is labeled with a dye and then incubated with the combinatorial library. An 
advantage of such an assay is the fast identification of active library beads by color change. 
In the lower concentrations of the dye-labeled target molecule, only those library beads that 
bind the target molecules most tightly are detected because of higher local concentration of 
the dye. When washed and plated into a liquid monolayer, colored beads are easily 
^ separated from non-colored beads with the aid of a dissecting microscope. One of the 
problems associated with this method could be the interaction between the red dye and 
library substrates. Control experiments using the dye alone and dye attached to mutant 
RNA sequences with the libraries are performed to eliminate this possibility. 



5.5.4. Suspension of Beads in Electric Fields 

In another embodiment of the invention, library beads bound to the target 
RNA can be separated from unbound beads on the basis of the altered charge properties due 
to RNA binding. In a preferred embodiment of this technique, beads are separated from 
unbound nucleic acid and suspended, preferably but not only, in the presence of an electric 
field where the bound RNA causes the beads bound to the target RNA to migrate toward the 
anode, or positive, end of the field. 

Beads can be preferentially suspended in solution as a colloidal suspension 
with the aid of detergents or surfactants. Typical detergents useful in the methods of the 
present invention include, but are not limited to, anionic detergents, such as salts of 
deoxycholic acid, 1-heptanesulfonic acid, N-laurylsarcosine, lauryl sulfate, 1 -octane 
sulfonic acid, carboxymethylcellulose, carrageenan, and taurocholic acid; cationic 
detergents such as benzalkonium chloride, cetylpyridinium, methylbenzethonium chloride, 
and decamethonium bromide; zwitterionic detergents such as CHAPS, CHAPSO, alkyl 
betaines, alky amidoalkyl betaines, 

N-dodecyl-N,N-dimethyl-3-ammonio-l-propanesulfonate, and phosphatidylcholine; and 
non-ionic detergents such as n-decyl a-D-glucopyranoside, n-decyl-D-maltopyranoside, 
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n-dodecyl -D-maltoside, n-octyl -D-glucopyranoside, sorbitan esters, n-tetradecyl 
-D-maltoside and tritons. Preferably, the detergent, if present, is a nonionic detergent. 
Typical surfactants useful in the methods of the present invention include, but are not 

5 limited to, ammonium lauryl sulfate, polyethylene glycols, butyl glucoside, decyl glucoside, 
Polysorbate 80, lauric acid, myristic acid, palmitic acid, potassium palmitate, undecanoic 
acid, lauryl betaine, and lauryl alcohol. 

Complexed RNA may or may not be irreversibly bound to the bead by a 
further transformation between the bound RNA and an additional moiety on the surface of 

j q the bead. Such linking methods include, but are not limited to: photochemical crosslinking 
between RNA and bead-bound molecules such as psoralen, thymidine or uridine derivates 
either present as monomers, oligomers, or as a partially complementary sequence; or 
chemical ligation by disulfide exchange, nitrogen mustards, bond formation between an 
electrophile and a nucleophile, or alkylating reagents. 

2 5 If the RNA is irreversibly bound to the bead, test compounds can be isolated 

from the bead following destruction of the bound RNA by enzymatic degradation including, 
but not limited to, ribonucleases A, U 2 , CL 3 , T„ Phy M, B. cereus or chemical degradation 
including, but not limited to, piperidine-promoted backbone cleavage of abasic sites 
(following treatment with sodium hydroxide, hydrazine, piperidine formate, or dimethyl 

20 sulfate), or metal-assisted (e.g. nickel(H), cobalt(EI), or iron(II)) oxidative cleavage. 

5.5.5. Microwave Spectroscopy 
In another embodiment, the complexed beads are separated from 
uncomplexed beads by microwave spectroscopy. For example, as described in U.S. Patent 

25 Nos. 6,395,480; 6,376,258; 6,368,795; 6,340,568; 6,338,968; 6,287,874; and 6,287,776 to 
Hefti, the disclosures of which are hereby incorporated by reference, the unique dielectric 
properties of molecules and binding complexes, such as hybridization complexes formed 
between a nucleic acid probe and a nucleic acid target, molecular binding events, and 
protein/ligand complexes, result in varying microwave spectra which can be measured. The 

^ 0 molecule's dielectric properties can be observed by coupling a test signal to the molecule 
and observing the resulting signal. When the test signal excites the molecule at a frequency 
within the molecule's dispersion regime, especially at a resonant frequency, the molecule 
will interact strongly with the signal, and the resulting signal will exhibit dramatic 
variations in its measured amplitude and phase, thereby generating a unique signal response. 

35 This res P onse can be used to detect and identify the bound molecular structure. In addition, 
because most molecules will exhibit different dispersion properties over the same or • 
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different frequency bands, each generates a unique signal response which can be used to 
identify the molecular structure. 

5.6. Methods for Identifying Test Compounds 
If the library is a peptide or nucleic acid library, the sequence of the test 
compound on the isolated bead can be determined by direct sequencing of the peptide or 
nucleic acid. Such methods are well known to one of skill in the art. 

5.6.1. Mass Spectrometry 

Mass spectrometry (e.g., electrospray ionization ("ESI") and matrix-assisted 
laser desorption-ionization ("MALDI"), Fourier-transform ion cyclotron resonance ("FT- 
ICR")) can be used both for high-throughput screening of test compounds that bind to a 
target RNA and elucidating the structure of the test compound on the isolated bead. 

MALDI uses a pulsed laser for desorption of the ions and a time-of-flight 
1 5 analyzer, and has been used for the detection of noncovalent tRN A:amino-acyl-tRN A 
synthetase complexes (Gruic-Sovulj etal, 1997, J. Biol. Chem. 272:32084-32091). 
However, covalent cross-linking between the target nucleic acid and the test compound is 
required for detection, since a non-covalently bound complex may dissociate during the 
MALDI process. 

~ J ESI mass spectrometry ("ESI-MS") has been of greater utility for studying 

non-covalent molecular interactions because, unlike the MALDI process, ESI-MS generates 
molecular ions with little to no fragmentation (Xavier et al, 2000, Trends Biotechnol. 
1 8(8):349-356). ESI-MS has been used to study the complexes formed by HIV Tat peptide 
and protein with the TAR RNA (Sannes-Lowery et al. , 1 997, Anal. Chem. 69:5 130-51 35). 

Fourier-transform ion cyclotron resonance ("FT-ICR") mass spectrometry 
provides high-resolution spectra, isotope-resolved precursor ion selection, and accurate 
mass assignments (Xavier et al, 2000, Trends Biotechnol. 18(8):349-356). FT-ICR has 
been used to study the interaction of aminoglycoside antibiotics with cognate and non- 
cognate RNAs (Hofstadler et al., 1999, Anal. Chem. 71 :3436-3440; Griffey et al, 1999, 

30 Proc. Natl. Acad. Sci. USA 96:10129-10133). As true for all of the mass spectrometry 
methods discussed herein, FT-ICR does not require labeling of the target RNA or a test 
compound. 

An advantage of mass spectroscopy is not only the elucidation of the 
structure of the test compound, but also the determination of the structure of the test 
35 compound bound to the preselected target RNA. Such information can enable the discovery 
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of a consensus structure of a test compound that specifically binds to a preselected target 
RNA. 

In a preferred embodiment, the structure of the test compound is determined 
by time of flight mass spectroscopy ("TOF-MS"). In time of flight methods of mass 
spectrometry, charged (ionized) molecules are produced in a vacuum and accelerated by an 
electric field into a time of flight tube or drift tube. The velocity to which the molecules 
may be accelerated is proportional to the accelerating potential, proportional to the charge of 
the molecule, and inversely proportional to the square of the mass of the molecule. The 
charged molecules travel, i.e. 5 "drift" down the TOF tube to a detector. The time taken for 
the molecules to travel down the tube may be interpreted as a measure of their molecular 
weight. Time-of-flight mass spectrometers have been developed for all of the major 
ionization techniques such as, but limited to, electron impact ("EI"), infrared laser 
desorption ("IRLD"), plasma desorption ("PD"), fast atom bombardment ("FAB"), 
secondary ion mass spectrometry ("SIMS"), matrix-assisted laser desorption/ionization 
("MALDI"), and electrospray ionization ("ESI"). 



5.6.2. NMR Spectroscopy 

NMR spectroscopy can be used for elucidating the structure of the test 
compound on the isolated bead. NMR spectroscopy is a technique for identifying binding 
sites in target nucleic acids by qualitatively determining changes in chemical shift, 
specifically from distances measured using relaxation effects. Examples of NMR that can 
be used for the invention include, but are not limited to, one-dimentional NMR, two- 
dimentional NMR, correlation spectroscopy ("COSY"), and nuclear Overhauser effect 
("NOE") spectroscopy. Such methods of structure determination of test compounds are 
well known to one of skill in the art. 

Similar to mass spectroscopy, an advantage of NMR is the not only the 
elucidation of the structure of the test compound, but also the determination of the structure 
of the test compound bound to the preselected target RNA. Such information can enable the 
discovery of a consensus structure of a test compound that specifically binds to a 
preselected target RNA. 



35 



5.6.3. Edman Degradation 

In an embodiment wherein the library is a peptide library or a derivative 
thereof, Edman degradation can be used to determine the structure of the test compound. In 
one embodiment, a modified Edman degradation process is used to obtain compositional 
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tags for proteins, which is described in U.S. Patent No. 6,277,644 to Famsworth et al , 
which is hereby incorporated by reference in its entirety. The Edman degradation chemistry 
is separated from amino acid analysis, circumventing the serial requirement of the 
conventional Edman process. Multiple cycles of coupling and cleavage are performed prior 
to extraction and compositional analysis of amino acids. The amino acid composition 
information is then used to search a database of known protein or DNA sequences to 
identify the sample protein. An apparatus for performing this method comprises a sample 
holder for holding the sample, a coupling agent supplier for supplying at least one coupling 
agent, a cleavage agent supplier for supplying a cleavage agent, a controller for directing the 
sequential supply of the coupling agents, cleavage agents, and other reagents necessary for 
performing the modified Edman degradation reactions, and an analyzer for analyzing amino 
acids. 

In another embodiment, the method can be automated as described in U.S. 
Patent No. 5,565,171 to Dovichi et al, which is hereby incorporated by reference in its 
entirety. The apparatus includes a continuous capillary connected between two valves that 
control fluid flow in the capillary. One part of the capillary forms a reaction chamber where 
the sample may be immobilized for subsequent reaction with reagents supplied through the 
valves. Another part of the capillary passes through or terminates in the detector portion of 
an analyzer such as an electrophoresis apparatus, liquid chromatographic apparatus or mass 
spectrometer. The apparatus may form a peptide or protein sequencer for carrying out the 
Edman degradation reaction and analyzing the reaction product produced by the reaction. 
The protein or peptide sequencer includes a reaction chamber for carrying out coupling and 
cleavage on a peptide or protein to produce derivatized amino acid residue, a conversion 
chamber for carrying out conversion and producing a converted amino acid residue and an 
analyzer for identifying the converted amino acid residue. The reaction chamber may be 
contained within one arm of a capillary and the conversion chamber is located in another 
arm of the capillary. An electrophoresis length of capillary is directly capillary coupled to 
the conversion chamber to allow electrophoresis separation of the converted amino acid 
residue as it leaves the conversion chamber. Identification of the converted amino acid 
residue takes place at one end of the electrophoresis length of the capillary. 

5.6.4. Vibrational Spectroscopy 

Vibrational spectroscopy (e.g. infrared (IR) spectroscopy or Raman 
spectroscopy) can be used for elucidating the structure of the test compound on the isolated 
bead. 
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Infrared spectroscopy measures the frequencies of infrared light 
(wavelengths from 100 to 10,000 nm) absorbed by the test compound as a result of 
excitation of vibrational modes according to quantum mechanical selection rules which 
require that absorption of light cause a change in the electric dipole moment of the 
molecule. The infrared spectrum of any molecule is a unique pattern of absorption 
wavelengths of varying intensity that can be considered as a molecular fingerprint to 
identify any compound. 

Infrared spectra can be measured in a scanning mode by measuring the 
absorption of individual frequencies of light, produced by a grating which separates 
frequencies from a mixed-frequency infrared light source, by the test compound relative to a 
standard intensity (double-beam instrument) or pre-measured ('blank') intensity 
(single-beam instrument). In a preferred embodiment, infrared spectra are measured in a 
pulsed mode (FT-IR) where a mixed beam, produced by an interferometer, of all infrared 
light frequencies is passed through or reflected off the test compound. The resulting 
interferogram, which may or may not be added with the resulting interferograms from 
subsequent pulses to increase the signal strength while averaging random noise in the 
electronic signal, is mathematically transformed into a spectrum using Fourier Transform or 
Fast Fourier Transform algorithms. 

Raman spectroscopy measures the difference in frequency due to absorption 
of infrared frequencies of scattered visible or ultraviolet light relative to the incident beam. 
The incident monochromatic light beam, usually a single laser frequency, is not truly 
absorbed by the test compound but interacts with the electric field transiently. Most of the 
light scattered off the sample will be unchanged (Rayleigh scattering) but a portion of the 
scatter light will have frequencies that are the sum or difference of the incident and 
molecular vibrational frequencies. The selection rules for Raman (inelastic) scattering 
require a change in polarizability of the molecule. While some vibrational transitions are 
observable in both infrared and Raman spectrometry, must are observable only with one or 
the other technique. The Raman spectrum of any molecule is a unique pattern of absorption 
wavelengths of varying intensity that can be considered as a molecular fingerprint to 
identify any compound. 

Raman spectra are measured by submitting monochromatic light to the 
sample, either passed through or preferably reflected off, filtering the Rayleigh scattered 
light, and detecting the frequency of the Raman scattered light. An improved Raman 
spectrometer is described in US Patent No. 5,786,893 to Fink et al 9 which is hereby 
incorporated by reference. 
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Vibrational microscopy can be measured in a spatially resolved fashion to 
address single beads by integration of a visible microscope and spectrometer. A 
microscopic infrared spectrometer is described in U.S. Patent No. 5,581 ,085 to Refiner et 
5 al. , which is hereby incorporated by reference in its entirety. An instrument that 

simultaneously performs a microscopic infrared and microscopic Raman analysis on a 
sample is described in U.S. Patent No. 5,841,139 to Sostek et al., which is hereby 
incorporated by reference in its entirety. 

In one embodiment of the method, test compounds are synthesized on 
10 polystyrene beads doped with chemically modified styrene monomers such that each 
resulting bead has a characteristic pattern of absorption lines in the vibrational (IR or 
Raman) spectrum, by methods including but not limited to those described by Fenniri et al, 
2000, J. Am. Chem. Soc. 123:8151-8152. Using methods of split-pool synthesis familiar to 
one of skill in the art, the library of compounds is prepared so that the spectroscopic pattern 
1 5 ofihQ bead identifies one of the components of the test compound on the bead. Beads that 
have been separated according to their ability to bind target RNA can be identified by their 
vibrational spectrum. In one embodiment of the method, appropriate sorting and binning of 
the beads during synthesis then allows identification of one or more further components of 
the test compound on any one bead. In another embodiment of the method, partial 
2Q identification of the compound on a bead is possible through use of the spectroscopic 
pattern of the bead with or without the aid of further sorting during synthesis, followed by 
partial resynthesis of the possible compounds aided by doped beads and appropriate sorting 
during synthesis. 

In another embodiment, the IR or Raman spectra of test compounds are 
25 examined while the compound is still on a bead, preferably, or after cleavage from bead, 
using methods including but not limited to photochemical, acid, or heat treatment The test 
compound can be identified by comparison of the IR or Raman spectral pattern to spectra ' 
previously acquired for each test compound in the combinatorial library. 



30 5.6.5. Microwave Spectroscopy 

In another embodiment, the microwave spectra of a test compound can be 

used to elucidate the structure of the test compound. For example, as described in U.S. 

Patent Nos. 6,395,480; 6,376,258; 6,368,795; 6,340,568; 6,338,968; 6,287,874; and 

6,287,776 to Hefti, the disclosures of which are hereby incorporated by reference, the 
35 unique dielectric properties of molecules and binding complexes, such as hybridization 

complexes formed between a nucleic acid probe and a nucleic acid target, molecular 
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binding events, and protein/ligand complexes, result in varying microwave spectra which 
can be measured. The molecule's dielectric properties can be observed by coupling a test 
signal to the molecule and observing the resulting signal. When the test signal excites the 
molecule at a frequency within the molecule's dispersion regime, especially at a resonant 
frequency, the molecule will interact strongly with the signal, and the resulting signal will 
exhibit dramatic variations in its measured amplitude and phase, thereby generating a 
unique signal response. This response can be used to detect and identify the bound 
molecular structure. In addition, because most molecules will exhibit different dispersion 
properties over the same or different frequency bands, each generates a unique signal 
response which can be used to identify the molecular structure. 
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5.7. Secondary Biological Screens 

The test compounds identified in the nonsense suppression assay (for 
convenience referred to herein as a "lead" compound) can be tested for biological activity 
using an in vitro transcribed RNA from the gene with the premature translation termination 
codon and in vitro translating that RNA in a cell-free translation extract. The activity of the 
lead compound in the in vitro translation mixture can be determined by any method that 
measures increased expression of the full-length gene, i.e., past the premature termination 
codon. For example, expression of a functional protein from the full-length gene (e.g. , a 
reporter gene) can be measured to determine the effect of the lead compound on premature 
translation termination and/or nonsense-mediated mRNA decay in an in vitro system. 

In addition, the lead compound can be tested in a host cell engineered to 
contain the RNA with the premature translation termination codon controlling the 
expression of a reporter gene. In this example, the lead compounds are assayed in the 
presence or absence of the RNA with the premature translation termination codon. 
Compounds that modulate premature translation termination and/or nonsense-mediated 
mRNA decay in vivo will result in increased expression of the full-length gene, Le. 9 past the 
premature termination codon. Alternatively, a phenotypic or physiological readout can be 
used to assess activity of the target RNA with the premature translation termination codon 
in the presence and absence of the lead compound. Both the in vitro and in vivo nonsense 
suppression assays used herein and as described in International Patent Publication WO 
01/4451 6, which is incorporated by reference in its entirety, can be used to identify lead 
compounds can also be used to determine an EC 50 for the lead compounds. 

The test compounds identified in the nonsense suppression assay (for . 
convenience referred to herein as a "lead" compound) can be tested for biological activity 
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using host cells containing or engineered to contain the target RNA element coupled to a 
functional readout system. For example, the lead compound can be tested in a host cell 
engineered to contain the RNA with the premature translation termination codon controlling 
the expression of a reporter gene. In (his example, the lead compounds are assayed in the 
presence or absence of the RNA with the premature translation termination codon. 
Compounds that modulate premature translation termination and/or nonsense-mediated 
mRNA decay in vivo will result in increased expression of the full-length gene, i.e., past the 
premature termination codon. Alternatively, a phenotypic or physiological readout can be 
used to assess activity of the target RNA with the premature translation termination codon 
in the presence and absence of the lead compound. Both the in vitro and in vivo nonsense 
suppression assays used herein and as described in International Patent Publication WO 
01/44516, which is incorporated by reference in its entirety, can be used to identify lead 
compounds can also be used to determine an EC j0 for the lead compounds. 

Animal model systems can also be used to demonstrate the safety and 
efficacy of the lead compounds identified in the nonsense suppression assays described 
above. The lead compounds identified in the nonsense suppression assay can then be tested 
for biological activity using animal models for a disease, condition, or syndrome of interest. 
These include animals engineered to contain the target RNA element coupled to a functional 
readout system, such as a transgenic mouse. 

Examples of animal models for cystic fibrosis include, but are not limited to, 
cftr(-/-) mice (see, e.g., Freedman et ah, 2001, Gastroenterology 121(4):950-7), 
cftr(tmlHGU/tmlHGU) mice (see, e.g., Bernhard et ah, 2001, Exp Lung Res 27(4):349- 
66), CFTR-deficient mice with defective cAMP-mediated Cl(-) conductance (see, e.g., 
Stotland et ah, 2000, Pediatr Pulmonol 30(5):413-24), and C57BL/6- 
Cftr(mlUNC)/Cftr(mlUNC) knockout mice (see, e.g., Stotland et ah, 2000, Pediatr 
Pulmonol 30(5):41 3-24). 

Examples of animal models for muscular dystrophy include, but are not 
limited to, mouse, hamster, cat, dog, and C. elegans. Examples of mouse models for 
muscular dystrophy include, but are not limited to, the dy-/- mouse (see, e.g., Connolly et 
ah, 2002, J Neuroimmunol 127(1 -2): 80-7), a muscular dystrophy with myositis (mdm) 
mouse mutation (see, e.g., Garvey et ah, 2002, Genomics 79(2): 146-9), the mdx mouse 
(see, e.g., Nakamura et ah, 2001, Neuromuscul Disord 1 1(3):251-9), the utrophin- 
dystrophin knockout (dko) mouse (see, e.g., Nakamura et ah, 2001 , Neuromuscul Disord 
1 1(3):251-9), the dy/dy mouse (see, e.g., Dubowitz et ah, 2000, Neuromuscul Disord 10(4- 
5):292-8), the mdx(Cv3) mouse model (see, e.g., Pillers et ah, 1999, Laryngoscope 
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109(8):13 10-2), and the myotonic ADR-MDX mutant mice (see, e.g., Kramer et al, 1998, 
Neuromuscul Disord 8(8):542-50). Examples of hamster models for muscular dystrophy ' 
include, but are not hmited to, sarcoglycan-deficient hamsters (see, e.g., Nakamura et al, 
5 2001, Am J Physiol Cell Physiol 281(2):C690-9) and the BIO 14.6 dystrophic hamster (see, 
^.,Schlenker&Burbach,1991,JApplPhysiol71(5):1655-62). An example of a feline 
model for muscular dystrophy includes, but is not limited to, the hypertrophic feline 
muscular dystrophy model (see, e.g., Gaschen & Burgunder, 2001, Acta Neuropathol (Berl) 
101(6):591-600). Canine models for muscular dystrophy include, but are not limited to, 
1 0 8 ° lden retriever muscular dystrophy (see, e.g. , Fletcher et al, 200 1, Neuromuscul Disord 
1 1(3):239-43) and canine X-linked muscular dystrophy (see, e.g., Valentine et al, 1992, 
Am JMedGenet42(3):352-6). Examples of C. elegans models for muscular dystrophy are 
described in Chamberlain & Benian, 2000, Curr Biol 10(21):R795-7 and Culette & Sattelle, 
2000, Hum Mol Genet 9(6):869-77. 
1 5 Examples of animal models for familial hypercholesterolemia include, but 

are not limited to, mice lacking functional LDL receptor genes (see, e.g., Aji et al., 1997, 
Circulation 95(2):430-7), Yoshida rats (see, e.g., Fantappie et al., 1992, Life Sci 
50(24):1913-24), the JCR:LA-cp rat (see, e.g., Richardson et al, 1998, Atherosclerosis 
138(l):135-46), swine (see, e.g., Hasler-Rapacz et al, 1998, Am J Med Genet 76(5):379- 
2Q 86), and the Watanabe heritable hyperlipidaemic rabbit (see, e.g., Tsutsumi et al., 2000, 
Arzneimittelforschung 50(2):1 18-21 ; Harsch et al., 1998, Br J Pharmacol 124(2):227-82; 
and Tanaka et al., 1995, Atherosclerosis 114(l):73-82). 

An example of an animal model for human cancer in general includes, but is 
not limited to, spontaneously occurring tumors of companion animals (see, e.g., Vail & 
25 MacEwen, 2000, Cancer Invest 18(8):781-92). Examples of animal models for lung cancer 
include, but are not limited to, lung cancer animal models described by Zhang & Roth 
(1994, In Vivo 8(5):755-69) and a transgenic mouse model with disrupted p53 function 
(see, e.g., Morris et al, 1998, J La State Med Soc 150(4): 179-85). An example of an 
animal model for breast cancer includes, but is not limited to, a transgenic mouse that 
3Q overexpresses cyclin Dl (see, e.g., Hosokawa etal, 2001, Transgenic Res 10(5):471-8). 
An example of an animal model for colon cancer includes, but is not limited to, a TCRbeta 
and p53 double knockout mouse (see, e.g., Kado et al, 2001, Cancer Res 61(6):2395-8). 
Examples of animal models for pancreatic cancer include, but are not limited to, a 
metastatic model of Panc02 murine pancreatic adenocarcinoma (see, e.g., Wang et al, 2001, 
35 Int J Pancreatol 29(l):37-46) and nu-nu mice generated in subcutaneous pancreatic tumours 
(see, e.g., Ghaneh et al, 2001, Gene Ther 8(3): 199-208). Examples of animal models for 
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non-Hodgkin's lymphoma include, but are not limited to, a severe combined 
immunodeficiency ("SCID") mouse (see, e.g., Bryant et al, 2000, Lab Invest 80(4):553-73) 
and an IgHmu-HOXl 1 transgenic mouse (see, e.g., Hough et al, 1998, Proc Natl Acad Sci 
USA 95(23): 13853-8). An example of an animal model for esophageal cancer includes, but 
is not limited to, a mouse transgenic for the human papillomavirus type 16 E7 oncogene 
(see, e.g., Herber et al, 1996, J Virol 70(3):1873-81). Examples of animal models for 
colorectal carcinomas include, but are not limited to, Ape mouse models (see, e.g., Fodde & 
Smits, 2001, Trends Mol Med 7(8):369-73 and Kuraguchi et al., 2000, Oncogene 
19(50):5755-63). An example of an animal model for neurofibromatosis includes, but is not 
limited to, mutant NF1 mice (see, e.g., Cichowski et al., 1996, Semin Cancer Biol 7(5):291- 
8). Examples of animal models for retinoblastoma include, but are not limited to, 
transgenic mice that expression the simian virus 40 T antigen in the retina (see, e.g., Howes 
et al., 1994, Invest Ophthalmol Vis Sci 35(2):342-51 and Windle et al, 1990, Nature 
343(6259):665-9) and inbred rats (see, e.g., Nishida et al., 1981, Curr Eye Res l(l):53-5 
and Kobayashi etal, 1982, ActaNeuropathol (Berl) 57(2-3):203-8). Examples of animal 
models for Wilm's tumor include, but are not limited to, a WT1 knockout mice (see, e.g., 
Scharnhorst et al., 1997, Cell Growth Differ 8(2):133-43), a rat subline with a high 
incidence of nephroblastoma (see, e.g., Mesfin & Breech, 1996, Lab Anim Sci 46(3):321- 
6), and a Wistar/Furth rat with Wilms' tumor (see, e.g., Murphy et al, 1987, Anticancer Res 
7(4B):717-9). 

Examples of animal models for retinitis pigmentosa include, but are not 
limited to, the Royal College of Surgeons ("RCS") rat (see, e.g., Vollrath et al., 2001, Proc 
Natl Acad Sci USA 98(22); 12584-9 and Hanitzsch et al., 1998, Acta Anat (Basel) 162(2- 
3): 11 9-26), a rhodopsin knockout mouse (see, e.g., Jaissle et al., 2001, Invest Ophthalmol 
Vis Sci 42(2):506-13), and Wag/Rij rats (see, e.g., Lai etal, 1980, Am J Pathol 98(1):281- 

Examples of animal models for cirrhosis include, but are not limited to, 
CCl 4 -exposed rats (see, e.g., Kloehn et al, 2001, Horm Metab Res 33(7):394-401) and 
rodent models instigated by bacterial cell components or colitis (see, e.g., Vierling, 2001, 
Best Pract Res Clin Gastroenterol 15(4):591-610). 

Examples of animal models for hemophilia include, but are not limited to, 
rodent models for hemophilia A (see, e.g., Reipert et al, 2000, Thromb Haemost 84(5):826- 
32; Jarvis et al.,. 1996, Thromb Haemost 75(2):318-25; and Bi et al, 1995, Nat Genet 
10(1):119-21), canine models for hemophilia A (see, e.g., Gallo-Penn etal, 1999, Hum 
Gene Ther 10(1 1):1791-802 and Connelly et al, 1998, Blood 91(9);3273-81), murine 
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models for hemophilia B (see, e.g., Snyder ef al, 1999, Nat Med 5(l):64-70; Wang et al, 
1997, Proc Natl Acad Sci USA 94(21): 1 1563-6; and Fang et al, 1996, Gene Ther 3(3):217- 
22), canine models for hemophilia B (see, e.g., Mount et al, 2002, Blood 99(8):2670-6; 
Snyder et al, 1999, Nat Med 5(l):64-70; Fang et al, 1996, Gene Ther 3(3):217-22); and 
Kay et al., 1 994, Proc Natl Acad Sci USA 91(6):2353-7), and a rhesus macaque model for 
hemophilia B (see, e.g., Lozier et al, 1999, Blood 93(6): 1875-81). 

Examples of animal models for von Willebrand disease include, but are not 
limited to, an inbred mouse strain RJUS/J (see, e.g., Nichols et al, 1994, 83(1 1):3225-31 
and Sweeney et al, 1990, 76(1 1):2258-65), rats injected with botrocetin (see, e.g., Sanders 
et al, 1988, Lab Invest 59(4):443-52), and porcine models for von Willebrand disease (see, 
e.g., Nichols et al, 1995, Proc Nad Acad Sci USA 92(7):2455-9; Johnson & Bowie, 1992, J 
Lab Clin Med 120(4):553-8); and Brinkhous et al, 1991, Mayo Clin Proc 66(7):733-42). 

Examples of animal models for p-thalassemia include, but are not limited to, 
murine models with mutations in globin genes (see, e.g., Lewis et al, 1998, Blood 
91(6):2152-6; Raja etal, 1994, Br J Haematol 86(l):156-62; Pdpp etal, 1985, 445:432-44; 
and Skow et al, 1983, Cell 34(3): 1043-52). 

Examples of animal models for kidney stones include, but are not limited to, 
genetic hypercalciuric rats (see, e.g., Bushinsky et al, 1999, Kidney Int 55(l):234-43 and 
Bushinsky etal, 1995, Kidney Int 48(6): 1705-13), chemically treated rats (see, e.g., Grases 
et al, 1998, Scand J Urol Nephrol 32(4):261-5; Burgess et al, 1995, Urol Res 23(4):239- 
42; Kumar et al, 1991, J Urol 146(5): 1384-9; Okada et al, 1985, Hinyokika Kiyo 
31(4):565-77; and Bluestone et al, 1975, Lab Invest 33(3):273-9), hyperoxaluric rats (see, 
e.g., Jones etal, 1991, J Urol 145 (4): 868-74), pigs with unilateral retrograde flexible 
25 nephroscopy (see, e.g. , Seifmah et al, 2001 , 57(4):832-6), and rabbits with an obstructed 
upper urinary tract (see, e.g., Itatani et al, 1979, Invest Urol 17(3):234-40). 

Examples of animal models for ataxia-telangiectasia include, but are not 
limited to, murine models of ataxia-telangiectasia (see, e.g., Barlow et al, 1999, Proc Nad 
Acad Sci USA 96(17):9915-9 and Inoue et al, 1986, Cancer Res 46(8):3979-82). 
30 Compounds displaying the desired biological activity can be considered to be 

lead compounds, and will be used in the design of congeners or analogs possessing useful 
pharmacological activity and physiological profiles. Following the identification of a lead 
compound, molecular modeling techniques can be employed, which have proven to be 
useful in conjunction with synthetic efforts, to design variants of the lead that can be more 
35 effective. These applications may include, but are not limited to, Pharmacophore Modeling 
(c/ Lamothe, etal. 1997, J. Med. Chem. 40: 3542; Mottola et al. 1996, J. Med. Chem. 39: 
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285; Beusen et al. 1995, Biopolymers 36: 181 ; P. Fossa et al. 1998, Comput Aided Mol. 
Des. 12: 361), QSAR development (cf. Siddiqui et al. 1999, J. Med. Chem. 42: 4122; 
Barreca et al. 1999 Bioorg. Med. Chem. 7: 2283; Kroemer et al. 1995, J. Med. Chem. 38: 
4917; Schaal et al. 2001, J. Med. Chem. 44: 155; Buolamwini & Assefa 2002, J. Mol. 
Chem. 45: 84), Virtual docking and screening/scoring (cf. Anzini et al. 2001, J. Med. Chem. 
44: 1 134; Faaland et al. 2000, Biochem. Cell. Biol. 78: 415; Silvestri et al. 2000, Bioorg. 
Med. Chem. 8: 2305; J. Lee et al. 2001, Bioorg. Med. Chem. 9: 19), and Structure 
Prediction using RNA structural programs including, but not limited to mFold (as described 
by Zuker et al. Algorithms and Thermodynamics for RNA Secondary Structure Prediction: 
A Practical Guide in RNA Biochemistry and Biotechnology pp. 1 1-43, J. Barciszewski & 
B.F.C. Clark, eds. (NATO ASI Series, Kluwer Academic Publishers, 1999) and Mathews et 
al 1999 J. Mol. Biol. 288: 91 1-940); RNAmotif (Macke et al. 2001, Nucleic Acids Res. 29: 
4724-4735; and the Vienna RNA package (Hofacker et al. 1994, Monatsh. Chem. 125: 
167-188). 

Further examples of the application of such techniques can be found in 
several review articles, such as Rotivinen et al., 1988, Acta Pharmaceutical Fennica 
97:159-166; Ripka, 1998, New Scientist 54-57; McKinaly & Rossmann, 1989, Annu. Rev. 
Pharmacol. Toxiciol. 29:111-122; Perry & Davies, QSAR: Quantitative Structure-Activity 
Relationships in Drug Design pp. 189-193 (Alan R. Liss, Inc. 1989); Lewis & Dean, 1989, 
Proc. R. Soc. Lond. 236:125-140 and 141-162; Askew etal., 1989, J. Am. Chem. Soc. 
Ill: 1082-1 090. Molecular modeling tools employed may include those from Tripos, Inc., 
St. Louis, Missouri (e.g., Sybyl/UNITY, CONCORD, DiverseSolutions), Accelerys, San 
Diego, California (e.g., Catalyst, Wisconsin Package {BLAST, etc.}), Schrodinger, 
Portland, Oregon (e.g., QikProp, QikFit, Jaguar) or other such vendors as BioDesign, Inc. 
(Pasadena, California), Allelix, Inc. (Mississauga, Ontario, Canada), and Hypercube, Inc. 
(Cambridge, Ontario, Canada), and may include privately designed and/or "academic" 
software (e.g. RNAMotif, mFOLD). These application suites and programs include tools 
for the atomistic construction and analysis of structural models for drug-like molecules, 
proteins, and DNA or RNA and their potential interactions. They also provide for the 
calculation of important physical properties, such as solubility estimates, permeability 
metrics, and empirical measures of molecular "druggability" (e.g., Lipinski "Rule of 5" as 
described by Lipinski et al. 1997, Adv. Drug Delivery Rev. 23: 3-25). Most importantly, 
they provide appropriate metrics and statistical modeling power (such as the patented 
CoMFA technology in Sybyl as described in US Patents 6,240,374 and 6, 1 85,506) to 
develop Quantitative Structural Activity Relationships (QSARs) which are used to guide the 
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synthesis of more efficacious clinical development candidates while improving desirable 
physical properties, as determined by results from the aforementioned secondary screening 
protocols. 

5 

5.8. Use of Identified Compounds That Bind RNA to Treat/Prevent Disease 

Biologically active compounds identified using the methods of the invention 
or a pharmaceutically acceptable salt thereof can be administered to a patient, preferably a 
mammal, more preferably a human, suffering from a disease whose progression is 

I q associated with premature translation termination and/or nonsense-mediated mRNA decay. 
In certain embodiments, such compounds or a pharmaceutically acceptable salt thereof is 
administered to a patient, preferably a mammal, more preferably a human, as a preventative 
measure against a disease associated with premature translation termination and/or 
nonsense-mediated mRNA decay. 

j 5 In a preferred embodiment, it is first determined that the patient is suffering 

from a disease associated with premature translation termination and/or nonsense-mediated 
mRNA decay. In a preferred embodiment, the DNA of the patient can be sequenced or 
subject to Southern blot, polymerase chain reaction ("PCR"), use of the Short Tandem 
Repeat ("STR"), or polymorphic length restriction fragments ("RFLP") analysis to 

20 determine if a nonsense mutation is present in the DNA of the patient. Alternatively, it can 
be determined if altered levels of the protein with the nonsense mutation are expressed in 
the patient by western blot or other immunoassays. Such methods are well known to one of 
skill in the art. 

In one embodiment, "treatment" or "treating" refers to an amelioration of a 
25 disease, or at least one discernible symptom thereof. In another embodiment, "treatment" or 
"treating" refers to an amelioration of at least one measurable physical parameter, not 
necessarily discernible by the patient. In yet another embodiment, "treatment" or "treating" 
refers to inhibiting the progression of a disease, either physically, e.g. , stabilization of a 
discernible symptom, physiologically, e.g., stabilization of a physical parameter, or both. In 
3Q yet another embodiment, "treatment" or "treating" refers to delaying the onset of a disease. 

In certain embodiments, the compound or a pharmaceutically acceptable salt 
thereof is administered to a patient, preferably a mammal, more preferably a human, as a 
preventative measure against a disease associated with a disease caused and/or associated 
with nonsense suppression. As used herein, "prevention" or "preventing" refers to a 
^ reduction of the risk of acquiring a disease. In one embodiment, the compound or a . 

pharmaceutically acceptable salt thereof is administered as a preventative measure to a 

i." 
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patient. According to this embodiment, the patient can have a genetic predisposition to a 
disease, such as a family history of the disease, or a non-genetic predisposition to the 
disease. Accordingly, the compound and pharmaceutically acceptable salts thereof can be 
used for the treatment of one manifestation of a disease and prevention of another. 
5 In a preferred embodiment, the compounds identified using the methods of 

the present invention are used to treat or prevent a disease caused by one or more nonsense 
mutations. Examples of diseases caused by nonsense mutations include, but are not limited 
to, cystic fibrosis, muscular dystrophy, heart disease, lung cancer, breast cancer, colon 
cancer, pancreatic cancer, non-Hodgkin's lymphoma, ovarian cancer, esophageal cancer, 
10 colorectal carcinomas, neurofibromatosis, retinoblastoma, Wilm's tumor, retinitis 
pigmentosa, collagen disorders, cirrhosis, Tay-Sachs disease, blood disorders, kidney 
stones, and ataxia-telangiectasia. Genes that contain one or more nonsense mutations that 
are potentially involved in causing disease are presented in table form according to 
chromosome location in Example 9 infra. 
1 5 when administered to a patient, the compound or a pharmaceutically 

acceptable salt thereof is preferably administered as component of a composition that 
optionally comprises a pharmaceutically acceptable vehicle. The composition can be 
administered orally, or by any other convenient route, for example, by infusion or bolus 
injection, by absorption through epithelial or mucocutaneous linings (e.g., oral mucosa, 
20 rectal, and intestinal mucosa, etc.) and may be administered together with another 
biologically active agent. Administration can be systemic or local. Various delivery 
systems are known, e.g. , encapsulation in liposomes, microparticles, microcapsules, 
capsules, etc., and can be used to administer the compound and pharmaceutically acceptable 
salts thereof. 

^ Methods of administration include but are not limited to intradermal, 

intramuscular, intraperitoneal, intravenous, subcutaneous, intranasal, epidural, oral, 
sublingual, intranasal, intracerebral, intravaginal, transdermal, rectally, by inhalation, or 
topically, particularly to the ears, nose, eyes, or skin. The mode of administration is left to 
the discretion of the practitioner. In most instances, administration will result in the release 
30 of the compound or a pharmaceutically acceptable salt thereof into the bloodstream. 

In specific embodiments, it may be desirable to administer the compound or 
a pharmaceutically acceptable salt thereof locally. This may be achieved, for example, and 
not by way of limitation, by local infusion during surgery, topical application, e.g., in 
conjunction with a wound dressing after surgery, by injection, by means of a catheter, by 
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means of a suppository, or by means of an implant, said implant being of a porous, non- 
porous, or gelatinous material, including membranes, such as sialastic membranes, or fibers. 

In certain embodiments, it may be desirable to introduce the compound or a 
pharmaceutically acceptable salt thereof into the central nervous system by any suitable 
5 route, including intraventricular, intrathecal and epidural injection. Intraventricular 
injection may be facilitated by an intraventricular catheter, for example, attached to a 
reservoir, such as an Ommaya reservoir. 

Pulmonary administration can also be employed, e.g., by use of an inhaler or 
nebulizer, and formulation with an aerosolizing agent, or via perfusion in a fluorocarbon or 
synthetic pulmonary surfactant. In certain embodiments, the compound and 
pharmaceutically acceptable salts thereof can be formulated as a suppository, with 
traditional binders and vehicles such as triglycerides. 

In another embodiment, the compound and pharmaceutically acceptable salts 
thereof can be delivered in a vesicle, in particular a liposome (see Langer, 1990, Science 
15 249:1527-1533; Treat et al, in Liposomes in the Therapy of Infectious Disease and Cancer, 
Lopez-Berestein and Fidler (eds.), Liss, New York, pp. 353-365 (1989); Lopez-Berestein, 
ibid., pp. 317-327; see generally ibid). 

In yet another embodiment, the compound and pharmaceutically acceptable 
salts thereof can be delivered in a controlled release system (see, e.g., Goodson, in Medical 
20 Applications of Controlled Release, supra, vol. 2, pp. 1 15-138 (1984)). Other controlled- 
release systems discussed in the review by Langer, 1990, Science 249:1527-1533) may be 
used. In one embodiment, a pump may be used (see Langer, supra; Sefton, 1987, CRC Crit. 
Ref. Biomed. Eng. 14:201; Buchwald et al., 1980, Surgery 88:507 Saudek et al., 1989, N. 
Engl. J. Med. 32 1 :574). In another embodiment, polymeric materials can be used (see 
Medical Applications of Controlled Release, Langer and Wise (eds.), CRC Pres., Boca 
Raton, Florida (1974); Controlled Drug Bioavailability, Drug Product Design and 
Performance, Smolen and Ball (eds.), Wiley, New York (1984); Ranger and Peppas, 1983, 
J. Macromol. Sci. Rev. Macromol. Chem. 23:61; see also Levy et al, 1985, Science 
228:190; During et al, 1989, Ann. Neurol. 25:351; Howard etal, 1989, J. Neurosurg. 
30 71 :105). In yet another embodiment, a controlled-release system can be placed in proximity 
of a target RNA of the compound or a pharmaceutically acceptable salt thereof, thus 
requiring only a fraction of the systemic dose. 

Compositions comprising the compound or a pharmaceutically acceptable 
salt thereof ("compound compositions") can additionally comprise a suitable amount of a 
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pharmaceutically acceptable vehicle so as to provide the form for proper administration to 
the patient 

In a specific embodiment, the term "pharmaceutically acceptable" means 
approved by a regulatory agency of the Federal or a state government or listed in the U.S. 
Pharmacopeia or other generally recognized pharmacopeia for use in animals, mammals, 
and more particularly in humans. The term "vehicle" refers to a diluent, adjuvant, excipient, 
or carrier with which a compound of the invention is administered. Such pharmaceutical 
vehicles can be liquids, such as water and oils, including those of petroleum, animal, 
vegetable or synthetic origin, such as peanut oil, soybean oil, mineral oil, sesame oil and the 
like. The pharmaceutical vehicles can be saline, gum acacia, gelatin, starch paste, talc, 
keratin, colloidal silica, urea, and the like. In addition, auxiliary, stabilizing, thickening, 
lubricating and coloring agents may be used. When administered to a patient, the 
pharmaceutically acceptable vehicles are preferably sterile. Water is a preferred vehicle 
when the compound of the invention is administered intravenously. Saline solutions and 
aqueous dextrose and glycerol solutions can also be employed as liquid vehicles, 
particularly for injectable solutions. Suitable pharmaceutical vehicles also include 
excipients such as starch, glucose, lactose, sucrose, gelatin, malt, rice, flour, chalk, silica 
gel, sodium stearate, glycerol monostearate, talc, sodium chloride, dried skim milk, 
glycerol, propylene, glycol, water, ethanol and the like. Compound compositions, if 
20 desired, can also contain minor amounts of wetting or emulsifying agents, or pH buffering 
agents. 

Compound compositions can take the form of solutions, suspensions, 
emulsion, tablets, pills, pellets, capsules, capsules containing liquids, powders, sustained- 
release formulations, suppositories, emulsions, aerosols, sprays, suspensions, or any other 
25 form suitable for use. In one embodiment, the pharmaceutically acceptable vehicle is a 
capsule (see e.g., U.S. Patent No. 5,698,155). Other examples of suitable pharmaceutical 
vehicles are described in Remington's Pharmaceutical Sciences, Alfonso R. Gennaro, ed., 
Mack Publishing Co. Easton, PA, 19th ed., 1995, pp. 1447 to 1676, incorporated herein by 
reference. 

In a preferred embodiment, the compound or a pharmaceutically acceptable 
salt thereof is formulated in accordance with routine procedures as a pharmaceutical 
composition adapted for oral administration to human beings. Compositions for oral 
delivery may be in the form of tablets, lozenges, aqueous or oily suspensions, granules, 
powders, emulsions, capsules, syrups, or elixirs, for example. Orally administered 
35 compositions may contain one or more agents, for example, sweetening agents such as 
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fructose, aspartame or saccharin; flavoring agents such as peppermint, oil of wintergreen, or 
cherry; coloring agents; and preserving agents, to provide a pharmaceuticaUy palatable 
preparation. Moreover, where in tablet or pill form, the compositions can be coated to delay 
disintegration and absorption in the gastrointestinal tract thereby providing a sustained 
action over an extended period of time. Selectively permeable membranes surrounding an 
osmotically active driving compound are also suitable for orally administered compositions. 
In these later platforms, fluid from the environment surrounding the capsule is imbibed by 
the driving compound, which swells to displace the agent or agent composition through an 
aperture. These delivery platforms can provide an essentially zero order delivery profile as 
opposed to the spiked profiles of immediate release formulations. A time delay material 
such as glycerol monostearate or glycerol stearate may also be used. Oral compositions can 
include standard vehicles such as mannitol, lactose, starch, magnesium stearate, sodium 
saccharine, cellulose, magnesium carbonate, and the like. Such vehicles are preferably of 
pharmaceutical grade. Typically, compositions for intravenous administration comprise 
1 5 sterile isotonic aqueous buffer. Where necessary, the compositions may also include a 
solubilizing agent. 

In another embodiment, the compound or a pharmaceutically acceptable salt 
thereof can be formulated for intravenous administration. Compositions for intravenous 
administration may optionally include a local anesthetic such as lignocaine to lessen pain at 
20 the site of the injection. Generally, the ingredients are supplied either separately or mixed 
together in unit dosage form, for example, as a dry lyophilized powder or water-free 
concentrate in a hermetically sealed container such as an ampoule or sachette indicating the 
quantity of active agent. Where the compound or a pharmaceutically acceptable salt thereof 
is to be administered by infusion, it can be dispensed, for example, with an infusion bottle 
containing sterile pharmaceutical grade water or saline. Where the compound or a 
pharmaceutically acceptable salt thereof is administered by injection, an ampoule of sterile 
water for injection or saline can be provided so that the ingredients may be mixed prior to 
administration. 

The amount of a compound or a pharmaceutically acceptable salt thereof that 
30 will be effective in the treatment of a particular disease will depend on the nature of the 
disease, and can be determined by standard clinical techniques. In addition, in vitro or in 
vivo assays may optionally be employed to help identify optimal dosage ranges. The precise 
dose to be employed will also depend on the route of administration, and the seriousness of 
the disease, and should be decided according to the judgment of the practitioner and each 
patient's circumstances. However, suitable dosage ranges for oral administration are 
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generally about 0.001 milligram to about 200 milUgrams of a compound or a 
pharmaceutical^ acceptable salt thereof per kilogram body weight per day. In specific 
preferred embodiments of the invention, the oral dose is about 0.01 milligram to about 100 
milligrams per kilogram body weight per day, more preferably about 0.1 milligram to about 
75 milligrams per kilogram body weight per day, more preferably about 0.5 milligram to 5 
milligrams per kilogram body weight per day. The dosage amounts described herein refer 
to total amounts administered; that is, if more than one compound is administered, or if a 
compound is administered with a therapeutic agent, then the preferred dosages correspond 
to the total amount administered. Oral compositions preferably contain about 10% to about 

95% active ingredient by weight. 

Suitable dosage ranges for intravenous (i.v.) administration are about 0.01 
milligram to about 100 milligrams per kilogram body weight per day, about 0.1 milligram to 
about 35 milligrams per kilogram body weight per day, and about 1 milligram to about 10 
milligrams per kilogram body weight per day. Suitable dosage ranges for intranasal 
15 administration are generally about 0.01 pg/kg body weight per day to about 1 mg/kg body 
weight per day. Suppositories generally contain about 0.01 milligram to about 50 
milligrams of a compound of the invention per kilogram body weight per day and comprise 
active ingredient in the range of about 0.5% to about 10% by weight. 

Recommended dosages for intradermal, intramuscular, intraperitoneal, 
20 subcutaneous, epidural, sublingual, intracerebral, intravaginal, transdermal administration or 
administration by inhalation are in the range of about 0.001 milligram to about 200 
milligrams per kilogram of body weight per day. Suitable doses for topical administration 
are in the range of about 0.001 milligram to about 1 milligram, depending on the area of 
administration. Effective doses may be extrapolated from dose-response curves derived 
25 from in vitro or animal model test systems. Such animal models and systems are well 
known in the art. 

The compound and pharmaceutically acceptable salts thereof are preferably 
assayed in vitro and in vivo, for the desired therapeutic or prophylactic activity, prior to use 
in humans. For example, in vitro assays can be used to determine whether it is preferable to 
30 administer the compound, a pharmaceutically acceptable salt thereof, and/or another 

therapeutic agent. Animal model systems can be used to demonstrate safety and efficacy. 

A variety of compounds can be used for treating or preventing diseases in 
mammals. Types of compounds include, but are not limited to, peptides, peptide analogs 
including peptides comprising non-natural amino acids, e.g., D-amino acids, phosphorous 
35 analogs of amino acids, such as a-amino phosphonic acids and a-amino phosphinic acids, 
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or amino acids having non-peptide linkages, nucleic acids, nucleic acid analogs such as 
phosphorothioates or peptide nucleic acids ("PNAs")> hormones, antigens, synthetic or 
naturally occurring drugs, opiates, dopamine, serotonin, catecholamines, thrombin, 
acetylcholine, prostaglandins, organic molecules, pheromones, adenosine, sucrose, glucose, 
lactose and galactose. 

6. EXAMPLE: COMPOUNDS THAT MODULATE TRANSLATION 

TERMINATION BIND SPECIFIC REGIONS OF 28S rRNA 

I q Data is presented in this Example that demonstrates that specific regions of 

the 28S rRNA are involved in modulating translation termination in mammalian cells. 
Compounds that interact in these regions or modulate local changes within these regions of 
the ribosome (e.g., alter base pairing interactions, base modification or modulate binding of 
trans-acting factors that bind to these regions) have the potential to modulate translation 

15 termination. These regions are conserved from prokaryotes to eukaryotes, but the role of 
these regions in modulating translation termination has not been realized in eukaryotes. In 
bacteria, when a short RNA fragment, complementary to the E. coli 23 S rRNA segment 
comprising nucleotides 735 to 766 (in domain II), is expressed in vivo, suppression of UGA 
nonsense mutations, but not UAA of UAG, results (Chernyaeva et al 9 1999, J Bacteriol 

20 181 -.5257-5262). Other regions of the 23S rRNA in E. coli have been implicated in 

nonsense suppression including the GTPase center in domain II (nt 1034-1 120; Jemiolo et 
al, 1995, Proc. Nat Acad. Sci. 92:12309-12313). 

6.1. Materials and Methods 

25 

6.1,1. Small Molecules Involved in Modulating Translation Termination 
Small molecules involved in modulating translation termination, Le. 9 
nonsense suppression, were used in the footprinting experiments presented in Figures 2 to 6 
and are listed as Compound A (molecular formula C 19 H 21 N0 4 ), Compound B (molecular 
30 formula C 19 H 2l N 2 0 5 ), Compound C (molecular formula C 12 H l5 N 5 0), Compound D 

(molecular formula C 23 H l5 0 3 Br), Compound E (molecular formula C 19 H 2 ,N0 4 ), Compound 
F, Compound G (molecular formula C l2 H I5 N 5 0), Compound H (molecular formula 
C^H^NOs), Compound I (molecular formula C2 3 H I5 N0 5 ), Compound J, and Compound K. 
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6.1.2. Preparation of » Translatio n Infract from HeLa cells 

HeLa S3 cells were grown to a density of 10 6 cells/ml in DMEM; 5%C0 2 , 
10%FBS, IX P/S in a spinner flask. Cells were harvested by spinning at lOOOXg. Cells 
were washed twice with phosphate buffered saline. The cell pellet was on ice for 12-24 
hours before proceeding. By letting the cells sit on ice, the activity of the extract is 
increased two-fold. The length of time on ice can range from 0 hours to 1 week. The cells 
were resuspended in 1.5 volumes (packed cell volume) of hypotonic buffer (10 mM HEPES 
(KOH) pH 7.4; 15 mM KC1; 1.5 mM Mg(OAc)2; 0.5 mM Pefabloc (Roche); 2 mM DTT). 
The cells were allowed to swell for 5 minutes on ice, dounce homogenized with 10 to 100 
strokes using a tight-fitting pestle, and spun for 10 minutes at 12000Xg at 4»C in a Sorvall 
SS-34 rotor. The supernatant was collected with a Pasteur pipet without disturbing the lipid 
layer, transferred into Eppendorf tubes (50 to 200 ul aliquots), and immediately frozen in 
liquid nitrogen. 



15 



6.1.3. Foot printing 

The HeLa translation extract was incubated with the small molecules (at a 
concentration of 100 uM), followed by treatment with chemical modifying agents (dimethyl 
sulfate [DMS] and kethoxal [KE]). Following chemical modification, rRNA was 

n phenol-chloroform extracted, ethanol precipitated, analyzed in primer extension reactions 
using end-labeled oligonucleotides hybridizing to different regions of the rRNAs and 
resolved on 6% polyacrylamide gels. The probes used for primer extension cover the entire 
18S (7 oligonucleotide primers), 28S (24 oligonucleotide primers), and 5S (one primer) 
rRNAs are presented in Table 1 (also see, e.g., Gonzalez et al., 1985 Proc 
Natl Acad Sci USA. 82(22) :7666-70 and McCallum & Maden, 1985, Biochem. J. 232 (3): 

25 725-733). Controls in these experiments include DMSO (a control for changes in rRNA 
accessibility induced by DMSO), paromomycin (a marker for 18S rRNA binding), and 



30 



35 



anisomycin (a marker for 28S rRNA binding). 


Table 1: 


18S, 28S, and 5S rRNA primers 


5S#1 


AAAGCCTACAGCACCC 


28S#1 


TACTGAGGGAATCCTGG 


28S#2 


TTACCACCCGCTTTGGG 


28S#3 


GGGGGCGGGAAAGATCC 



SEQ ID NO.: 1 
SEQ ID NO.: 2 
SEQ ID NO.: 3 
SEQ ID NO.: 4 
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28S#4 
28S#5 
28S#6 
28S#7 
28S#8 
28S#9 
28S#10 
28S#1 1 
28S#12 
28S#13 
28S#14 
I 5 28S#15 
28S#16 
28S#17 
28S#18 
20 28S#19 
28S#20 
28S#21 
28S#22 
25 28S#23 
28S#24 
18S#1 
18S#2 
18S#3 
18S#4 
18S#5 
18S#6 
18S#7 



30 



35 



CCCCGAGCCACCTTCCC 

GGCCCCGGGATTCGGCG 

CACTGGGGACAGTCCGC 

CGCGGCGGGCGAGACGGG 

GAGGGAAACTTCGGAGGG 

CATCGGGCGCCTTAACCC 

CGACGCACACCACACGC 

CCAAGATCTGCACCTGC 

TTACCGCACTGGACGCC 

GCCAGAGGCTGTTCACC 

TGGGGAGGGAGCGAGCGGCG 

AAGGGCCCGGCTCGCGTCC 

AGGGCGGGGGGACGAACCGC 

TTAAACAGTCGGATTCCCCTGG 

TTCATCCATTCATGCGCG 

AGTAGTGGTATTTCACCGG 

ACGGGAGGTTTCTGTCC 

ACAATGATAGGAAGAGCCG 

AGGCGTTCAGTCATAATCCC 

TCCGCACCGGACCCCGGTCC 

GGGCTAGTTGATTCGGCAGGTGAGTTG 

TCTCCGGAATCGAACCCT 

ATT ACC GCGGCTGCTGGC 

TTGGCAAATGCTTTCGC 

CCGTCAATTCCTTTAAGTTTC 

AGGGCATCACAGACCTGTTAT 

CGACGGGCGGTGTGTAC 

CCGCAGGTTCACCTACGG 

-46- 



SEQ ID NO.: 5 
SEQIDNO.: 6 
SEQ ID NO.: 7 
SEQ ID NO.: 8 
SEQ ID NO.: 9 
SEQ ID NO.: 10 
SEQIDNO.: 11 
SEQIDNO.: 12 
SEQIDNO.: 13 
SEQIDNO.: 14 
SEQIDNO: 15 
SEQIDNO.: 16 
SEQIDNO.: 17 
SEQIDNO.: 18 
SEQIDNO.: 19 
SEQ ID NO.: 20 
SEQ ID NO.: 21 
SEQ ID NO.: 22 
SEQ ID NO.: 23 
SEQ ID NO.: 24 
SEQ ID NO.: 25 
SEQ ID NO.: 26 
SEQ ID NO.: 27 
SEQ ID NO.: 28 
SEQ ID NO.: 29 
SEQ ID NO.: 30 
SEQIDNO.: 31 
SEQIDNO.: 32 
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6.2. Results 

The results of these foot-printing experiments (see, e.g., Figures 2 to 6) 
indicated that the small molecules involved in modulating translation termination alter the 
accessibility of the chemical modifying agents to specific nucleotides in the 28S rRNA. 
More specifically, the regions protected by the small molecules include a conserved region 
in the vicinity of the peptidyl transferase center (domain V, see, e.g., Figures 2 and 3) 
implicated in peptide bond formation and a conserved region in domain H (see, e.g., Figures 
4 5 and 6) that may interact with the peptidyl transferase center based on binding of 
vemamycin B to both these areas (Vannuffel et al., 1994, Nucleic Acids Res. 
22(21):4449-53). 

9. EXAMPLE: HIMANDISEASE £ENEj> SO RTED BY CHROMOSO ME 



Table 2: 



Genes, Locations and Genetic Disorders on Chromosome 1 



Gene 

ABCA4 



ABCD3 



ACADM 



GDB Accession 
GDB.370748 



ID I OMIM Link 



GDB:131485 



GDB:118958 



MACULAR DEGENERATION, SENILE 
STARGARDT DISEASE 1; STGD1 ATP 
BINDING CASSETTE TRANSPORTER; 
ABCR RETINITIS PIGMENTOSA-19; 

RP19 

I PEROXISOMAL MEMBRANE PROTEIN 
1 1; PXMP1 



ACYL-CoA DEHYDROGENASE, 
| MEDIUM-CHAIN; ACADM 



25 AGL 



AGT 
ALDH4A1 



ALPL 



30 



AMPD1 



APOA2 



AVSD1 



35 



GDB: 132644 



GDB:1 18750 
GDB-.9958827 



GLYCOGEN STORAGE DISEASE IE 
ANGIOTENSIN I; AGT 
HYPERPROLINEMIA, TYPE II 



GDB: 118730 



I PHOSPHATASE, LIVER ALKALINE; 
ALPLHYPOPHOSPHATASIA, 

INFANTILE 



GDB:1 19677 

GDB:1 19685 
GDB:265302 



ADENOSINE MONOPHOSPHATE 
D EAMINASE- 1 ; AMPD1 

A POLIPOPROTEBSf A-n; APOA2 

ATRIOVENTRICULAR SEPTAL 
1 DEFECT; AVSD 
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( 


Gene 


GDB Accession ID | < 


DMIM Link 




BRCD2 


GDB:9955322 I ! 


BREAST CANCER, DUCTAL, 2; BRCD2 


5 


C1QA 


GDB:1 19042 1 


COMPLEMENT COMPONENT 1, q 
SUBCOMPONENT, ALPHA 
POLYPEPTIDE; C1QA 




C1QB 


GDB: 119043 1 


COMPLEMENT COMPONENT 1, q 
SUBCOMPONENT, BETA 
POLYPEPTIDE; C1QB 


10 


C1QG 


GDB:128132 


COMPLEMENT COMPONENT 1, q 
SUBCOMPONENT, GAMMA 
POLYPEPTIDE; C1QG 




C8A 


GDB:1 19735 1 


COMPLEMENT COMPONENT-8, 
DEFICIENCY OF 


15 


C8B 


GDB:119736 1 


COMPLEMfcrN 1 CUMrUlNriiN i-o, 
DEFICIENCY OF, TYPE II 


CACNA1S 


GDB: 126431 j 


«1T /-ITT T\ a OUT A XiXTIhT 

CALCIUM CHAIN JNriJj, 
VOLTAGE-DEPENDENT, L TYPE, 
ALPHA IS SUBUNIT; CACNA1 S 
PERIODIC PARALYSIS I MALIGNAN 1 
HYPERTHERMIA SUSCEPTIB1LITY-5; 
MHS5 


20 


1 ccv 


GDB: 1336655 


1 CATARACT, CONGENITAL, 
| VOLKMANN TYPE; CCV 




1 CD3Z 


GDB: 119766 


CD3Z ANTIGEN, ZETA POLYPEPTIDE; 
|CD3Z 




1 CDC2L1 


GDB: 127827 


| PROTEIN KINASE p58; PK58 


25 


1 PTTMT 


GDB:135222 


| CHOROIDEREMIA-LIKE; CHML 




1 CHSl 


GDB:4568202 


1 CHEDIAK-HJGASHI SYNDROME; 
CHSl 


30 


1 CIAS1 


GDB:9957338 


COLD HYPERSENSITIVITY 
URTICARIA, DEAFNESS, AND 
AMYLOIDOSIS 








1 CLCNKB 


GDB:698472 


CHLORIDE CHANNEL, KIDNEY, B; 
CLCNKB 


35 


1 CMD1A 


GDB:434478 


CARDIOMYOPATHY, DILATED 1A; 
CMD1A 
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t 



&Q39 



K .. 07"i2M-Oa 



1 


Gene 


GDB Accession ID 


OMBM Link j 


1 


CMH2 


GDB:137324 


CARDIOMYOPATHY, FAMILIAL 
HYPERTROPHIC, 2; CMH2 


f 1 


l^iVJJVl 


GDB:1 19059 


MELANOMA, MALIGNANT 




COLl 1 Al 


GDB: 120595 


COLLAGEN, TYPE XI, ALPHA- 1; 
COL11A1 








10 


COL9A2 


GDB:138310 


COLLAGEN, TYPE IX, ALPHA-2 
CHAIN; COL9A2 EPIPHYSEAL 
DYSPLASIA, MULTIPLE, 2; EDM2 


15 

■ 


CPT2 


GDB:127272 


MYOPATHY WITH DEFICIENCY OF 
CARNITINE 

PALMITOYLTRANSFERASE II 
HYPOGLYCEMIA, HYPOKETOTIC, 
WITH DEFICIENCY OF CARNITINE 
PALMITOYLTRANSFERASE 
CARNITINE 

PALMITOYLTRANSFERASE II; CPT2 




CRB1 


GDB:333930 


RETINITIS PIGMENTOSA-12; RP12 








20 




GDB:596182 


CHOREOATHETOSIS/SPASTICITY, 
EPISODIC; CSE 






GDB: 126430 


COLONY STIMULATING FACTOR 3 
RECEPTOR, GRANULOCYTE; CSF3R 




1 PTT> A 

1 LlrA 


GDB:9863168 


CATARACT, POSTERIOR POLAR 






GDB:453910 


PYCNODYSOSTOSIS CATHEPSIN K; 
CTSK 




1 DBT 


GDB: 11 8784 


MAPLE SYRUP URINE DISEASE, TYPE 
2 


30 


Idioi 


GDB: 136449 


THYROXINE DEIODINASE TYPE I; 
TXDI1 


1 DISCI 


GDB:9992707 


DISORDER-2; SCZD2 








1 DPYD 


GDB:364102 


DIHYDROPYRIMIDINE 
DEHYDROGENASE; DPYD 


35 


Iekv 


GDB:1 19106 


ERYTHROKERATODERMA 
VARIABILIS; EKV 



-49- 



NY2- 1321355.1 



1 


Gene < 


GDB Accession ID 


0M1M Link 1 




ENOl 


GDB: 119871 


PHOSPHOPYRUVATE HYDRA 1 Abb, 1 
PPH 


5 


ENOIP 


GDB: 135006 


PHOSPHOPYRUVATE HYDRATASb, 
PPH 






GDB- 11 9865 


ERYTHROCYTE MEMBRANE 
PROTEIN BAND 4. 1 ; EPB41 
HEREDITARY HEMOLYTIC j 


10 


EPHXl 


GDB: 119876 


EPOXIDE HYDROLASE 1 , 
MICROSOMAL; EPHXl 1 




F13B 


GDB:1 19893 ! 


FACTOR Xffl, B SUBUNIT; Fl 3B 




rj 


GDB- 119896 


FACTOR V DEFICIENCY 






GDB: 119903 


Fc FRAGMENT OF IgG, LOW AFFINITY 
Ba, RECEPTOR FOR; FCGR2A 


15 


FCGR2B 


GDB:128183 


Fc FRAGMENT OF IgG, LOW AFFINITY 
Ha, RECEPTOR FOR; FCGR2A 






GDB: 11 9904 


Fc FRAGMENT OF IgG, LOW AFFINITY 
ma, RECEPTOR FOR; FCGR3 A 




FCHL 


GDB:9837503 


HYPERLIPEMIA, COMBINED 










FH 


GDB:1 19133 


FUMARATE HYDRATASE; FH j 
LEIOMYOMATA, HEREDITARY 
MULTIPLE, OF SKIN 


25 


r ivivyj 


GDB:135136 


FLAVIN-CONTAINING 
MONOOXYGENASE 3; FM03 j 
TRIMETHYLAMINURIA I 




FM04 


GDB: 127981 


FLAVIN-CONTAINING 1 
MONOOXYGENASE 2; FM02 I 






GDB:1 19237 


FUCOSIDOSIS 


30 


FY 


GDB: 11 9242 


BLOOD GROUP-DUFFY SYSTEM; Fy 




GALE 


GDB: 11 9245 


GALACTOSE EPIMERASE 
DEFICIENCY 




GBA 


GDB: 119262 


GAUCHER DISEASE, TYPE I; GD I J 









35 
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1 


Gene 1 


GDB Accession ID < 


OMIMLink 




GFND 


GDB:9958222 < 


GLOMERULAR NEPHRITIS, 
FAMILIAL, WITH FIBRONECTIN 
DEPOSITS 


5 - 


GJA8 


GDB-.696369 


CATARACT, ZONULAR 

nTTT T7T7T5T1T ETXTT 1 • (tAP 

PULVERULtiJN 1 1, v^Ziri vjt\r 

JUNCTION PROTEIN, ALPHA-8, 50-KD; 
GJA8 


10 


GJB3 


GDB: 127820 


ERYTHROKERATODERMIA 
VARIABILIS; EKV DEAFNESS, 
AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 
2; DFNA2 ] 








GLC3B 


GDB.3801939 


GLAUCOMA 3, PRIMARY INFANTILE, 
B; GLC3B j 


15 


HF1 


GDB: 120041 


H FACTOR 1;HF1 1 




HMGCL 


GDB:1 38445 


HYDROXYMETHYLGLUTARICACIDU 
RIA; HMGCL 1 


20 


HPC1 


GDB:5215209 


PROSTATE CANCER; PRCA1 
PROSTATE CANCER, HEREDITARY 1 


HRD 


GDB:9862254 


HYPOPARATHYROIDISM WITH 
SHORT STATURE, MENTAL 
RETARDATION, AND SEIZURES 


25 


HRPT2 


GDB: 125253 


HYPERPARATHYROIDISM, FAMILIAL 
PRIMARY, WITH MULTIPLE 
OSSIFYING JAW _] 


HSD3B2 


GDB: 134044 


ADRENAL HYPERPLASIA n 1 


30 


HSPG2 


GDB:126372 


HEPARAN SULFATE I 
PROTEOGLYCAN OF BASEMENT 
MEMBRANE; HSPG2 MYOTONIC 
MYOPATHY, DWARFISM, 
CHONDRODYSTROPHY, AND 
OCULAR AND FACIAL 1 




KCNQ4 


GDB:439046 


DEAFNESS, AUTOSOMAL DOMINANT 1 
NONSYNDROMIC SENSORINEURAL, 
2; DFNA2 









35 
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Gene 


GDB Accession ID 


OMIM Link j 




KCS 


GDB.9848740 


KENNY-CAFFEY SYNDROME, 
RECESSIVE FORM | 


5 


KIF1B 


GDB: 128645 


CHARCOT-MARIE-TOOTH DISEASE, 
NEURONAL TYPE, A; CMT2A 










LAMB 3 


GDB.251820 


LAMININ, BETA 3; LAMB3 


10 j 


LAMC2 


GDB: 136225 


T a Tv/irNTTNr OA MM A 1- LAMC2 
EPIDERMOLYSIS BULLOSA LETALIS 




LGMD1B 


GDB :23 1606 


Ml T<JPT IT AR DYSTROPHY, 1 
LIMB-GIRDLE, TYPE IB; LGMD1B ) 




T H/fXTA 


GDB:132146 


LAMIN A/C; LMNA LIPODYSTROPHY, I 
T7AMTT TAT PARTIAL DUNNIGAN 
TYPE; LDP1 1 


15 


t r\x> 
LAJiv 


GDB:13204y 


T nRTPRTTsT- T OR 1 




MCKD1 


GDB:9859381 


POLYCYSTIC KIDNEYS, MEDULLARY 
TYPE 




MCL1 


GDB: 1391 37 


MYELOID CELL LEUKEMIA 1 ; MCL1 




MPZ 


GDB: 125266 


HYPERTROPHIC NEUROPATHY OF 
DEJERINE-SOTTAS MYELIN PROTEIN 
ZERO; MPZ | 




MTHFR 


GDB:370882 


5, 1 0-@METH YLENETETRAHYDROFO 1 
LATE REDUCTASE; MTHFR J 


25 


1 MTR 


GDB: 119440 


METHYLTETRAHYDROFOLATE:L-HO 1 
MOCYSTEINE 

S-METHYLTRANSFERASE; MTR 




Imutyh 


GDB:9315115 

• 


ADENOMATOUS POLYPOSIS OF THE 
COLON; APC J 








30 


Imyoc 


GDB:5584221 


GLAUCOMA 1, OPEN ANGLE; GLC1A 
MYOCILIN; MYOC ] 




| NB 


GDB:9958705 


NEUROBLASTOMA; NB J 




NCF2 


GDB:120223 


GRANULOMATOUS DISEASE, j 
CHRONIC, AUTOSOMAL 
CYTOCHROME-b-POSHTVE FORM j 



35 
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1 *\n*± 1 


GDB Accession ID < 


3MIM Link J 


Inemi < 


GDB: 127387 1 


vTEMALINE MYOPATHY 1 , 
AUTOSOMAL DOMINANT; NEM1 J 


I \TDUO 


GDB:9955617 


ARRHYTHMOGENIC RIGHT 
VENTRICULAR DYSPLASIA, 
FAMILIAL, 2; ARVD2 __J 






NPPA 1 


GDB: 11 8727 


xt a tr n TO FTTC PFPTIDE PRECURSOR 
A; NPPA 


1 NRAS 1 


GDB: 119457 | 


rvMrrvfrFiMF NRAS- NRAS* NRAS 1 1 


Intrki I 


GDB:127897 1 


ONCOGENE TRK NEUROTROPHIC 
TVPn<5TNF KINASE RECEPTOR, TYPE 
1 ; NTRK1 NEUROPATHY, j 
CONGENITAL SENSORY, WITH 

A "V TT TTT"Vrt /""> O TO 1 

ANHIDROSIS 1 


1 ADTAO 
1 UrlAZ 


GDB:9955577 1 


OSTEOPETROSIS*, AU I Uouivi/\l. 
DOMINANT, TYPE II; OPA2 J 


1 DDY1 

1 rrJAx 1 


GDB:125351 


PRE-B-CELL LEUKJiMlA 
TRANSCRIPTION FACTOR-1 ; PBX1 


1 DPUP 


| GDB.9955586 


PHEOCHROMOCYTOMA | 


PGD 


GDB: 119486 


| DEHYDROGENASE, ERYTHROCYTE 


1 "DXJ ATA 


GDB-9955628 


1 PSEUDOHYPOALDOSTERONISM, 

1 TYPE II; PHA2 _ J 


1 DTJ/TTYtJ 

1 rHOLixl 


1 GDB:9958261 


3-@PHOSPHOGLYCERATE 1 
| DEHYDROGENASE DEFICIENCY | 


Ipklr 


1 GDB:120294 


PYRUVATE KINASE DEFICIENCY OF 1 
1 ERYTHROCYTE 1 


1 PKP1 


| GDB:4249598 


PLAKOPHILIN 1; PKP1 1 


1 PLA2G2A 

0 


1 GDB: 120296 


1 PHOSPHOLIPASE A2, GROUP DA; 
PLA2G2A 


1 PLOD 


1 GDB: 127821 


1 PROCOLLAGEN-LYSINE, 1 
2-OXOGLUTARATE 5-DIOXYGENASE; 
PLOD EHLERS-DANLOS SYNDROME, 

| TYPE VI; E-D VI; EDS VI _l 


1 PPOX 
5 I 


1 GDB:1 18852 


PROTOPORPHYRINOGEN OXIDASE; 
1 PPOX 1 
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Gene 



GDB Accession ID 



OMIM Link 



PPT 



GDB: 125227 



CEROID-LIPOFUSCINOSIS, 
NEURONAL 1, INFANTILE; CLN1 
PALMITOYL-PROTEIN 
THIOESTERASE; PPT 



PRCC 



PRG4 



10 



GDB:3888215 



GDB:9955719 



PAPILLARY RENAL CELL 
CARCINOMA; PRCC 



ARTHROPATHY-CAMPTODACTYLY 
SYNDROME 



PSEN2 



PTOS1 



15 REN 



GDB:633044 



GDB:6279920 



GDB: 120345 



ALZHEIMER DISEASE, FAMILIAL, 
TYPE 4; AD4 



TOSIS, HEREDITARY CONGENITAL 
l;PTOSl 



RENIN; REN 



RFX5 



RHD 



RMD1 



20 



RPE65 



GDB:6288464 



GDB:119551 



GDB:448902 



GDB:226519 



REGULATORY FACTOR 5; RFX5 



RHESUS BLOOD GROUP, D ANTIGEN; 
RHD 



RIPPLING MUSCLE DISEASE-1; RMD1 



RETINAL PIGMENT 
EPITHELRJM-SPECIFIC PROTEIN, 
65-KD; RPE65 AMAUROSIS 
CONGENITA OF LEBER H 



SCCD 



25 



SERPINC1 



SJS1 



GDB-.9955558 



GDB: 11 9024 



GDB-.1381631 



CORNEAL DYSTROPHY, 
CRYSTALLINE, OF SCHNYDER 



ANTITHROMBIN UI DEFICIENCY 



MYOTONIC MYOPATHY, DWARFISM, 
CHONDRODYSTROPHY, AND 
OCULAR AND FACIAL 



SLC19A2 



30 



GDB:9837779 



THIAMINE-RESPONSIVE 
MEGALOBLASTIC ANEMIA 
SYNDROME 



SLC2A1 



GDB: 120627 



SOLUTE CARRIER FAMILY 2, 
MEMBER 1; SLC2A1 



35 
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20 



Gene ( 


GDB Accession ID | 


OMIM Link 1 


SPTA1 


GDB: 119601 


ELLIPTOCYTOSIS, 
RHESUS-UNLINKED TYPE 
HEREDITARY HEMUL Y 1 
SPECTRIN, ALPHA, ERYTHROCYTIC 

1;SPTA1 


TAT 1 1 


GDB: 120759 1 


T-CELL ACUTE LYMPHOCYTIC 
LEUKEMIA 1; TALI 


JLlNJrorv) i 


GDB:422178 


APOPTOSIS ANTIGEN LIG AND 1 ; 
APT1LG1 


TNTWT9. 


GDB-.221879 


TROPONIN -T2, CARDIAC; TNNT2 


I rlvi j 


GDB: 127872 


ONCOGENE TRK TROPOMYOSIN 3; 
TPM3 , 


TSHB 


GDB:120467 


THYROID-STIMULATING HORMONE, 
BETA CHAIN; TSHB 


UMPK 


nr>R- 190481 


URIDINE MONOPHOSPHATE KINASE; 
UMPK _j 


uox 


1 VJ J_/ H> . L £r J ~s*J s 


1 URATE OXIDASE; UOX 


UROD 


|GDB:119628 


PORPHYRIA CUTANEA TARDA; PCT 


> USH2A 


1 GDB: 120483 


1 T 1<3WFR SYNDROME. TYPE II; USH2 


VMGLOM 


| GDB:9958134 


1 GLOMUS TUMORS, MULTIPLE 


VWS 


1 GDB: 120532 


CLEFT LIP AND/OR PALATE WITH 
1 MUCOUS CYSTS OF LOWER LIP j 


WS2B 

5 


1 GDB.407579 


WAARDENBURG SYNDROME, TYPE 
2B; WS2B 



Table 3: Genes, Locations and Genetic Disorders on Chromosome 2 



Gene 


GDB Accession ID 


Location 


OMIM Link 


ABCB11 


GDB:9864786 


2q24-2q24 
2q24.3-2q24.3 


CHOLESTASIS, PROGRESSIVE 
FAMILIAL INTRAHEPATIC 2; 
PFIC2 


ABCG5 


GDB:10450298 


2p21-2p21 


PHYTOSTEROLEMIA 


ABCG8 


GDB: 10450300 


2p21-2p21 


PHYTOSTEROLEMIA 


ACADL 


GDB: 11 8745 


2q34-2q35 


ACYL-CoA 
DEHYDROGENASE, 
LONG-CHAIN, DEFICIENCY OF 



30 



35 
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1 4 




jDB Accession ID I 


vocation | < 


)MIM Link 


|i 


\CP1 |< 


3DB: 118962 h 


> P 25-2p25 jj 


>HOSPHATASE, ACID, OF 
iRYTHROCYTE; ACPI 


5 L 


AGXT 


3DB:127113 1: 


Jq37.3-2q37.3 |< 


OXALOSIS I 




AHHR ( 


GDB: 11 8984 1: 


2pter-2q31 < 

1 


CYTOCHROME P450, 
SUBFAMILY I, POLYPEPTIDE 
1;CYP1A1 


10 

l 


ALMS1 1 


GDB:9865539 J 


2pl3-2pl2 1 

2pl4-2pl3 

2pl3.1-2pl3.1 


ALSTROM SYNDROME 




ALPP 1 


GDB:119672 1 


2q37.1-2q37.1 1 


ALKALINE PHOSPHATASE, 
PLACENTAL; ALPP 




ALS2 1 


GDB: 135696 1 


2q33-2q35 1 


AMYOTROPHIC LATERAL 
SCLEROSIS 2, JUVENILE; ALS2 


15 


APOB 1 


GDB: 119686 ! 


2p24-2p23 
2p24-2p24 


APOLIPOPROTEIN B; APOB 




BDE 


GDB:9955730 


2q37-2q37 


BRACHYDACTYLY, TYPE E; 
BDE 


20 


Ibdmr 


|GDB:533064 


2q37-2q37 


1 BRACHYDACTYLY-MENTAL 
RETARDATION SYNDROME; 
|BDMR 




Ibjs 


|GDB:9955717 


|2q34-2q36 


1 TORTI AND NERVE 
1 DEAFNESS 


25 


IBMPR2 


|GDB:642243 


|2q33-2q33 
2q33-2q34 


1 PULMONARY 

HYPERTENSION, PRIMARY; 

PPH1 BONE MORPHOGENETIC 
1 RECEPTOR TYPE II; BMPR2 




CHRNA1 


GDB:120586 


|2q24-2q32 


CHOLINERGIC RECEPTOR, 
NICOTINIC, ALPHA 
| POLYPEPTIDE 1; CHRNA1 


30 


CMCWTD 


1 GDB: 11498919 


|2p22.3-2p21 


1 FAMILIAL CHRONIC 
MUCOCUTANEOUS, 
1 DOMINANT TYPE 




CNGA3 


IgDB:434398 


|2qll.2-2qll.2 


COLORBLINDNESS, TOTAL 
CYCLIC NUCLEOTIDE GATED 
CHANNEL, OLFACTORY, 3; 
CNG3 


35 
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Gene 



GDB Accession ID Location 



OMIMLink 



ICOL3A1 |GDB:118729 2q31-2q32.3 COLLAGEN [ TYPE ffl; COL3 Al 

1 f 2q32.2-2q32.2 I EHLERS-DANLOS 

1 SYNDROME, TYPE IV, 

AUTOSOMAL DOMINANT 



COL4A3 



GDB:128351 



1 2q36-2q37 COLLAGEN, TYPE IV, 

lALPHA-3 CHAIN; COL4A3 



COL4A4 GDB: 132673 



1 2q35-2q37 COLLAGEN, TYPE IV, 
1 ALPHA-4 CHAIN; COL4A4 



■ 0 ICOUA3 |GDB:,.9066 ^^h!^COL5A3 

MYOPATHY, BENIGN 
CONGENITAL, WITH 
CONTRACTURES 



CPS1 



GDB: 119799 



2q33-2q36 
2q34-2q35 
2q35-2q35 



HYPERAMMONEMIA DUE TO 
CARBAMOYLPHOSPHATE 
SYNTHETASE I DEFICIENCY 



CRYGA 



I GDB- 1 1 9076 1 2q33-2q35 CRYSTALLIN, GAMMA A; 

CRYGA 



CRYGEP1 GDB:1 19808 



1 2q33-2q35 CRYSTALLIN, GAMMA A; 
' " CRYGA 



20 CYP1B1 GDB:353515 



1 2p21 -2p21 GLAUCOMA 3, PRIMARY 
2p22-2p21 INFANTILE, A; GLC3A 
2pter-2qter CYTOCHROME P450, 
SUBFAMILY I 
(DIOXIN-INDUCIBLE), 
POLYPEPTIDE 1 ; CYP1B1 



CYP27A1 GDB: 128129 



|2q33-2qter CEREBROTENDINOUS 
XANTHOMATOSIS 



DBI 



GDB: 119837 



|2ql2-2q21 DIAZEPAM BINDING 
' * INHIBITOR; DBI 



DES 



GDB: 11 9841 



2q35-2q35 DESMIN; DES 



DYSF 



GDB:340831 



2p-2p 

2pl3-2pl3 

2pter-2pl2 



EDAR 



GDB:9837372 



2qll-2ql3 



MUSCULAR DYSTROPHY, 
LIMB-GIRDLE, TYPE 2B; 
LGMD2B MUSCULAR 
DYSTROPHY, LATE-ONSET 
DISTAL 



DYSPLASIA, HYPOHIDROTIC 
ECTODERMAL DYSPLASIA, 
ANHIDROTIC 
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Gene 



GDB Accession ID Location 



OMIM Link 



EFEMP1 



GDB: 12201 11 



2pl6-2pl6 



DOYNE HONEYCOMB 
DEGENERATION OF RETINA 
FIBRILLIN-LIKE; FBNL 



EEF2AK3 



GDB:9956743 



2pl2-2pl2 



EPIPHYSEAL DYSPLASIA, 
MULTIPLE, WITH 
EARLY-ONSET DIABETES 
MELLITUS 



ERCC3 



GDB:1 19881 



2q21-2q21 



EXCISION-REPAIR, 
COMPLEMENTING 
DEFECHVE, IN CHINESE 
HAMSTER, 3; ERCC3 



FSHR 



GDB: 1275 10 



2p21-2pl6 



FOLLICLE-STIMULATING 
HORMONE RECEPTOR; FSHR 
GONADAL DYSGENESIS, XX 
TYPE 



GAD1 



GDB: 119244 



2q31-2q31 



PYRIDOXINE DEPENDENCY 
WITH SEIZURES 



GINGF 



GDB:9848875 



2p21-2p21 



GINGIVAL SON OF 
SEVENLESS (DROSOPHILA) 
HOMOLOG 1; SOS1 



GLC1B 



GDB-.1297553 



2ql-2ql3 



GLAUCOMA 1 , OPEN ANGLE, 
B; GLC1B 



GPD2 



GDB:354558 



2q24.1-2q24.1 



GLYCEROL-3-PHOSPHATE 
DEH YDROGENASE-2 ; GPD2 



GYPC 



GDB: 120027 



2ql4-2q21 



BLOOD GROUP-GERBICH; Ge 



HADHA 



GDB:434026 



2p23-2p23 



HYDROXYACYL-CoA 
DEHYDROGENASE/3-KETOAC 

YL-CoA 

TfflOLASE/ENOYL-CoA 
HYDRATASE, 



HADHB 



HOXD13 



GDB:344953 



2p23-2p23 



HYDROXYACYL-CoA 
DEHYDROGENASE/3-KETOAC 

YL-CoA 

THIOLASE/ENOYL-CoA 
HYDRATASE, 



GDB: 127225 



2q31-2q31 



HOMEO BOX-D13; HOXD13 
SYNDACTYLY, TYPE II 



HPE2 



GDB:136066 



2p21-2p21 



MIDLINE CLEFT SYNDROME 



-58- 
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Gene 


GDB Accession ID 


vocation 


OMIM Link 






2nl2-2nl2 
2pll.2-2pll.2 


IMMUNOGLOBULIN KAPPA 
CONSTANT REGION; IGKC 


IHH 


GDB:511203 


2q33-2q35 
2q35-2q35 
2pter-2qter 


BRACHYDACTYLY, TYPE Al; 
nn a i TXmT AM TTFnOFHOG 

DROSOPHELA, HOMOLOG OF; 

IHH 


IRS1 


GDB: 133974 


2q36-2q36 


INSULIN RECEPTOR 
SUBSTRATE 1; IRbl 


ITGA6 


GDB: 128027 


2pter-2qter 


INTEGRIN, ALPHA-6; ITGA6 


KHK 


GDB:391903 


2pzi.i-zpzJ.z 


T7 P T Tr'TO^lT TO T A 


KYNU 


GDB:9957925 j 


2q22.2-2q23.3 




LCI 


<TTYR«1 901 4.0 
\JLsD , 1 Z.U 1 *tU 


2a21-2a21 


DISACCHARIDE 
INTOLERANCE n 




nnn- 19^9^0 


2r>21-2o21 


LUTEINIZING 

HORMONE/CHORIOGONADOT 
ROPIN RECEPTOR; LHCGR 


LSFC 


GDB:9956219 


2-2 2pl6-2pl6 


CYTOCHROME c OXIDASE 
DEFICIENCY, 

FRENCH-CANADIAN TYPE 


MSH2 


GDB:203983 


2pl6-2pl6 
2p22-2p21 


COLON CANCER, FAMILIAL, 
NONPOLYPOSIS TYPE 1; FCC1 


MSH6 


GDB:632803 


2pl6-2pl6 


G/T MISMATCH-BINDING 
PROTEIN; GTBP 


NEB 


GDB: 120224 


2q24.1-2q24.2 


NEBULIN; NEB NEMALINE 
MYOPATHY 2, AUTOSOMAL 
RECESSIVE; NEM2 


NMTC 






THYROID CARCINOMA, 
PAPILLARY 


NPHP1 


GDB: 128050 


2ql3-2ql3 


NEPHRONOPHTHISIS, 
FAMILIAL JUVENILE 1 ; NPHP1 


PAFAH1P1 


GDB:435099 


2pll.2-2pll.2 


PLATELET-ACTIVATING 
FACTOR 

ACETYLHYDROLASE, 
GAMMA SUBUNIT 


PAX3 


GDB: 120495 


2q36-2q36 
2q35-2q35 


KLEIN- WAARDENBURG 
SYNDROME WAARDENBURG 
SYNDROME; WS1 
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Gene J ( 


JDB Accession ID I 


.location | ( 


)MIM Link 


PAX8 |( 


3DB: 136447 1 


iql2-2ql4 |l 
|< 


'AIRED BOX HOMEOTIC 
3ENE 8; PAX8 


jpMSl |( 


3DB:386403 1: 


>q31-2q33 |l 
1 

|< 


'OSTMEIOTIC SEGREGATION 
NCREASED (S. 
DEREVISIAE)- 1 ; PMS 1 


Ipnkd l< 


3DB:5583973 V 


2q33-2q35 V 
J 


3HOREOATHETOSIS, 
FAMILIAL PAROXYSMAL; 
FPD1 


Ipphi I 


GDB:1381541 1 


2q31-2q32 
2q33-2q33 


PULMONARY 

HYPERTENSION, PRIMARY; 
PPH1 


Iproc I 


GDB:120317 1 


2ql3-2q21 I 
2ql3-2ql4 


PROTEIN C DEFICIENCY, 
CONGENITAL THROMBOTIC 
DISEASE DUE TO 


Iregia I 


GDB:132455 i 


2pl2-2pl2 1 


REGENERATING 
ISLET-DERIVED 1 -ALPHA; 
REG1A 


|SAG 


GDB:120365 


|2q37.1-2q37.1 


S-ANTIGEN; SAG 


IsFTPB 


GDB: 120374 


2pl2-2pll.2 


SURFAL 1 ATM 1 -Aoovj^ia i 
PROTEIN, PULMON ARY-3 ; 
SFTP3 I 


IsLCllAl 


lGDB:371444 


2q35-2q35 


I /-<tt>dxj/~»otc T>l?TNyl AT?V* PRP 
1 CIRKJtiUMo, r KlJVIAiv. I > rDKs 

NATURAL 

RESISTANCE-ASSOCIATED 
MACROPHAGE PROTEIN 1; 
|NRAMP1 


' ISLC3A1 


IgDB:202968 


|2pl6.3-2pl6.3 
2p21-2p21 


SOLUTE CARRIER FAMILY 3, ! 
MEMBER 1;SLC3A1 
1 CYSTINURIA; CSNU 


Isosi 


IgDB:230004 


|2p22-2p21 


1 GINGIVAL SON OF 
SEVENLESS (DROSOPHILA) 
1 univtnT on 1 • SOSl 


} ISPG4 


|GDB:230127 


|2p24-2p21 


I SPASTIC PARAPLEGI A-4, 
AUTOSOMAL DOMINANT; 
|SPG4 


1 SRD5A2 


1 GDB: 127343 


|2p23-2p23 


I PSEUDOVAGINAL 
PERINEOSCROTAL 
HYPOSPADIAS; PPSH 
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Gene 


uUd Accession iu 


.Location 


tYMTM Link 


TCL4 


GDB: 136378 


2q34-2q34 


T-CELL 

T FT TJCFMrA/LYMPHOMA^* 
TCL4 


TGFA 


GDB: 120435 


2pl3-2pl3 


TRANSFORMING GROWTH 

FArTOR AT PT-TA* TfrFA 


TMD 


GDB:9837196 


2q31-2q31 


TIBIAL MUSCULAR 
DYSTROPHY, TARDIVE 


TPO 


GDB: 120446 


2p25-2p25 
2p25-2p24 


THYROID 

HORMONOGENESIS, GENETIC 

TYRFPr'T TIM TTA 


UGT1 


GDB:120007 


2q37-2q37 


UDP 

OT TTCTTRONOSYLTRANSFERA 
SE 1 FAMILY, Al; UGT1A1 


UV24 


GDB:9955737 


2pter-2qter 


UV-DAMAGE, EXCISION 
REPAIR OF, UV-24 


WSS 


GDB:9955707 


2q32-2q32 


WRINKLY SKIN SYNDROME; 
WSS 


XDH 


GDB:266386 


2p23-2p22 


XANTHINURIA 


ZAP70 


GDB:433738 


2qll-2ql3 
2ql2-2ql2 


SYK-RELATED TYROSINE 
KINASE; SRK 


ZFHX1B 


GDB:9958310 


2q22-2q22 


DISEASE, MICROCEPHALY, 
AND IRIS COLOBOMA 



Table 4: Genes, Locations and Genetic Disorders on Chromosome 3 
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Gene 


GDB Accession ID 


Location 


OMIM Link 


ACAA1 


GDB:1 19643 


3p23-3p22 


PEROXISOMAL 
3-OXOACYL-COENZYME A 
THIOLASE DEFICIENCY 


AGTR1 


GDB: 132359 


3q21-3q25 


ANGIOTENSIN H RECEPTOR, 
VASCULAR TYPE 1; AT2R1 


AHSG 


GDB: 11 8985 


3q27-3q27 


ALPHA-2-HS-GLYCOPROTEIN; 
AHSG 


AMT 


GDB: 1321 38 


3p21.3-3p21.2 
3p21.2-3p21.1 


HYPERGLYCINEMIA, ISOLATED 
NONKETOTIC, TYPE II; NKH2 


ARP 


GDB:9959049 


3p21.1-3p21.1 


ARGINDME-RICH PROTEIN 


BBS3 


GDB:376501 


3p-3p 

3pl2.3-3qll.l 


BARDET-BIEDL SYNDROME, 
TYPE3;BBS3 
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Gene 


GDB Accession ID 


Location 


OMEM Link 




BCHE 


GDB: 120558 


3q26.1-3q26.2 


BUTYRYLCHOLINESTERASE; 


5 


BCPM 


GDB:433809 


3q21-3q21 


BENIGN CHRONIC PEMPHIGUS; 




BTD 


GDB:309078 


3p25-3p25 


BIOTBSTCDASE; BTD 


10 


CASR 


GDB:134196 


3q21-3q24 


HYPOCALCIURIC 

HHC1 


CCR2 


GDB:337364 


3p21-3p21 


CHEMOKINE (C-C) RECEPTOR 2; 
CMK13K2 




CCR5 


GDB:1230510 


3p21-3p21 


CHEMOKINE (C-C) RECEPTOR 5; 
CMivbrO 


15 


CDL1 


GDB: 136344 


3q26.3-3q26.3 


DE LANGE SYNDROME; CDL 


CMT2B 


GDB:604021 


3ql3-3q22 


CHARCOT-MARIE-TOOTH 
DISEASE, NEURONAL TYPE, B; 
CMT2B 




COL7A1 


GDB: 128750 


3p21-3p21 
3p21.3-3p21.3 


COLLAGEN, TYPE VII, ALPHA- 1; 
COL7A1 


20 


CP 


GDB: 119069 


3q23-3q25 
3q21-3q24 


CERULOPLASMEST; CP 




CRV 


GDB: 11498333 


3p21.3-3p21.1 


VASCULOPATHY, RETINAL, 
WITH CEREBRAL 


25 


CTNNB1 


GDB:141922 


3p22-3p22 
3p21.3-3p21.3 


CATENIN, BETA 1; CTNNB1 




DEM 


GDB:681157 


3pl2-3qll 


DEMENTIA, FAMILIAL 
NONSPECIFIC; DbM 




ETM1 


GDB:9732523 


3ql3-3ql3 


TREMOR HEREDITARY 
ESSENTIAL 1;ETM1 


30 


FANCD2 


GDB:698345 


3p25.3-3p25.3 
3pter-3p24.2 


FANCONI PANCYTOPENIA, 
COMPLEMENTATION GROUP D 




FIH 


GDB:9955790 


3ql3-3ql3 


HYPOPARATHYROIDISM, 
FAMILIAL ISOLATED; FIH 


35 


FOXL2 


GDB:129025 


3q23-3q23 
3q22-3q23 


BLEPHAROPHIMOSIS, 
EPICANTHUS INVERSUS, AND 
PTOSIS; BPES 



15 



20 



30 



Gene 


GDB Accession ID 


Location 


OMIM Link 


GBE1 


GDB:138442 


3pl2-3pl2 


GLYCOGEN STORAGE DISEASE 
IV 


GLB1 


GDB: 119987 


3p22-3p21.33 ! 
3p21.33-3p21.33 


GANGLIOSIDOSIS, 
GENERALIZED GMl, TYPE I 


GLC1C 


GDB.-3801941 


3q21-3q24 j 


GLAUCOMA l, OPEN ANGLE, C; 
GLC1C 


GNAI2 


GDB:120516 


3p2l.3-3p2l.2 


/~*T T A XTTX1T? 

GUANINE 

NUCLEOTIDE-BINDING 
PROTEIN, ALPHA-INHIBITING, 
POLYPEPTIDE-2; 


GNAT1 


GDB: 119277 


3p2l.3-3p2l.2 


GUANINE 

NUCLEOTIDE-BINDING 
PROTEIN, 

ALPHA-TRANSDUCING, 
POLYPEPTIDE 


GP9 


GDB:126370 


3pter-3qter 


PLATELET GLYCOPROTEIN DC; 
GP9 


GPX1 


GDB:1 19282 


3qll-3ql2 
3p2l.3-3p2l.3 


GLUTATHIONE PEROXIDASE; 
GPXl 


HGD 


GDB:203935 


3q2l-3q23 


ALKAPTONURIA; AKU 


HRG 


GDB: 120055 


3q27-3q27 


HISTIDINE-RICH 
GLYCOPROTEIN; HRG; HRGP 


1T1H1 


GDB:120107 


3p2l.2-3p2l.l 


INTER-ALPHA-TRYPSIN 
INHIBITOR, HEAVY CHAIN-l; 
ITIHl;IATIH; ITIH 


KNG 


GDB:125256 


3q27-3q27 


FLAUJEAC FACTOR 
DEFICIENCY 


LPP 


GDB: 1391795 


3q27-3q28 


LIM DOMAIN-CONTAINING 
PREFERRED TRANSLOCATION 
PARTNER IN LIPOMA; LPP 


LRS1 


GDB:682448 


3p2l.l-3pl4.l 


LARSEN SYNDROME, 
AUTOSOMAL DOMINANT; LRSl 


MCCC1 


GDB: 135989 


3q27-3q27 
3q25-3q27 


BETA-METHYLCROTONYLGLYC 
INURIAI 


MDS1 


GDB:250411 


3q26-3q26 


MYELODYSPLASIA SYNDROME 
l;MDSl 



35 
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GDB Accession ID ] 


Location < 


OMIM Link 




MHS4 


GDB:574245 


3ql3.1-3ql3.1 


:i YPEK 1 ribKMlA 
SUSCEPTroiLITY-4; MHS4 


5 


MTTF 


GDB:214776 


3pl4.1-3pl2 


MICROPHTHALMIA- ASSOCIATE 
J 1 KANoCKlr 1 1UJN r Al_. 1 UK., 
MTTF WAARDENBURG 
SYNDROME, TYPE II; WS2 




MLH1 


GDB:249617 


3p23-3p22 
3p21.3-3p21.3 


COLON CANCER, FAMILIAL, 
NONPOLYPOSIS TYPE 2; FCC2 


10 


MYL3 


GDB: 12021 8 


3p21.3-3p21.2 


MYOSIN, LIGHT CHAIN, ALKALI, 
VENTRICULAR AND SKELETAL 
SLOW; MYL3 j 




MYMY 


GDB: 11500610 


3p26-3p24.2 


DISEASE 




0PA1 


GDB: 118848 


3q28-3q29 


OPTIC ATROPHY 1; OPA1 


15 


PBXP1 


GDB: 125352 


3q22-3q23 j 


PRE-B-CELL LEUKEMIA ! 
TRANSCRIPTION FACTOR-1; 
PBX1 




PCCB 


GDB: 119474 


3q21-3q22 


PT "vr/TXTTTlV AT A VETnTTP TT 

GLYCiNriMlA, JsJd 1U1 iC, 11 


20 


POU1F1 


GDB: 129070 


3pll-3pll 


POU DOMAIN, CLASS 1, 
lKANoCKlr 11UIN rALIUR 1, 
POU1F1 




PPARG 


GDB:1223810 


3p25-3p25 


CANCER OF COLON 
PEROXISOME PROLIFERATOR 
ACTIVATED KECbr 1 UK, 
GAMMA; PPARG 


25 


PROS1 


GDB: 120721 


3pll-3qll 
3pll.l-3qll.2 


TV!"* rv'PL'lKT O AT TiT T A . "DO /^O 1 

PROTEIN S, ALPHA; rKUM 




PTHR1 


GDB:138128 


3p22-3p21.1 


METAPHYSEAL 
CHONDRODYSPLASIA, MURK 
JANSEN TYPE PARATHYROID 
HORMONE RECEPTOR 1; PTHR1 


30 


RCA1 


GDB:230233 


3pl4.2-3pl4.2 


RENAL CARCINOMA, r AM1L1AJL, 
ASSOCIATED 1;RCA1 




RHO 


GDB: 120347 


3q21.3-3q24 


RHODOPSIN; RHO 




SCA7 


GDB:454471 


3p21.1-3pl2 


SPINOCEREBELLAR ATAXIA 7; 
SCA7 


35 


SCLC1 


GDB:9955750 


3p23-3p21 


SMALL-CELL CANCER OF THE 
LUNG; SCCL 



-64- NY2- 1321355.1 



J 

( 








0MIM Link 




SCN5A 


GDB: 132152 


Jp21-3p21 


SODIUM CHANNEL, 
VOLTAGE-GATED, TYPE V, 
ALPHA POLYPEPTIDE; SCN5A 


5 


SI 


GDB: 120377 


3q25.2-3q26.2 


DISACCHARIDE INTOLERANCE I 




SLC25A20 


GDB:6503297 


3p21.31-3p21.31 


CARNITINE-ACYLCARNrriNE 
TRANSLOCASE; CACT 


1 0 
1 yJ 


SLC2A2 


GDB-1 19995 


3q26.2-3q27 
3q26.1-3q26.3 


SOLUTE CARRIER FAMILY 2, 
MEMBER 2; SLC2A2 
FANCONI-BICKEL SYNDROME; \ 
FBS 




lr 






TRANSFERRIN; TF 


15 


TGFBR2 


GDB:224909 


3p22-3p22 


TRANSFORMING GROWTH 
FACTOR-BETA RECEPTOR 
TYPE H; TGFBR2 




frTYR^ 74007 




THROMBOPOIETIN: THPO 




THRB 


GDB:120731 


3p24.1-3p22 
ln24 3-3t>24 3 


THYROID HORMONE 
RECEPTOR BETA; THRB 


20 


1K.1 




JU 1 ".J J LJ 1*T.-J 


WERNICKE-KORSAKOFF 
SYNDROME 


TM4SF1 


GDB:250815 


3q21-3q25 


TUMOR-ASSOCIATED ANTIGEN 
L6- TAAL6 




TRH 


GDB:128072 


3pter-3qter 


THYROTROPIN-RELEASING 
HORMONE DEFICIENCY 




UMPS 


GDB:120482 


3ql3-3ql3 


OROTICACIDURIA I 


UQCRC1 


GDB:141850 


3p21.3-3p21.2 
3p21.3-3p21.3 


UBIQUINOL-CYTOCHROME c 
REDUCTASE CORE PROTEIN I; 
UQCRC1 




USH3A 


GDB:392645 


3q21-3q25 


USHER SYNDROME, TYPE III; 
USH3 


30 


VHL 


GDB: 120488 


3p26-3p25 


VON HIPPEL-LEMDAU 
SYNDROME; VHL 




WS2A 


GDB: 128053 


3pl4.2-3pl3 


MICROPHTHALMIA-ASSOCIATE 
D TRANSCRIPTION FACTOR; 
MITF WAARDENBURG 
SYNDROME, TYPE II; WS2 



35 
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Gene 


GDB Accession Ixf 


Location 


OA/ITIVf T Ink 

WlV.ll.iYJL JLflUH. 


XPC 


GDB:134769 


3p25.1-3p25.1 


XERODERMA PIGMENTOSUM, 
COMPLEMENTATION GROUP C; 
XPC 


ZNF35 


GDB: 120507 


3p21-3p21 


ZINC FINGER PROTEIN-35; 
ZNF35 



Table 5: Genes, Locations and Genetic Disorders on Chromosome 4 





icne 


GDB Accession ID 


vocation 


OMIMLink 


10 


ADH1B 


GDB:1 19651 


4q21-4q23 
4q22-4q22 


ALCOHOL 

DEHYDROGENASE-2; ADH2 




ADH1C 


GDB:1 19652 


4q21-4q23 
4q22-4q22 


ALCOHOL 

DEHYDROGENASE-3; ADH3 




AFP 


GDB:1 19660 


4qll-4ql3 


ALPHA-FETOPROTEIN; AFP 


15 


AGA 


GDB:1 18981 


4q23-4q35 
4q32-4q33 


ASPARTYLGLUCOSAMINURIA 
;AGU 




AIH2 


GDB:1 18751 


4qll-4ql3 
4ql3.3-4q21.2 


AMELOGENESIS IMPERFECTA 
2, HYPOPLASTIC LOCAL, 
AUTOSOMAL DOMINANT; 




ALB 


GDB:1 18990 


4qll-4ql3 


ALBUMIN; ALB 


20 


ASMD 


GDB: 11 9705 


4q-4q4q28-4q31 


ANTERIOR SEGMENT OCULAR 
DYSGENESIS; ASOD 




BFHD 


GDB: 11498907 


4q34.1-4q35 


DYSPLASIA, BEUKES TYPE 


25 


CNGA1 


GDB: 127557 


4pl4-4ql3 


CYCLIC NUCLEOTIDE GATED 
CHANNEL, PHOTORECEPTOR, 
cGMP GATED, 1; CNCG1 


CRBM 


GDB:9958132 


4pl6.3-4pl6.3 


CHERUBISM 




DCK 


GDB:126810 


4ql3.3-4q21.l 


DEOXYCYTIDINE KINASE; 
DCK 


30 


DFNA6 


GDB:636175 


4pl6.3-4pl6.3 


DEAFNESS, AUTOSOMAL 
DOMINANT NONS YNDROMIC 
SENSORINEURAL, 6; DFNA6 




DSPP 


GDB:5560457 


4pter-4qter 
4q21.3-4q21.3 


DENTIN PHOSPHOPROTEIN; 
DPP DENTINOGENESIS 
IMPERFECTA; DGI1 




DTDP2 


GDB:9955810 


4q-4q 


DENTIN DYSPLASIA, TYPE II 


35 


ELONG 


GDB:1 1498700 


4q24-4q24 





- 66 - NY2- 1321355.1 



Gene 



GDB Accession ID (Location 



lOMIM Link 



ENAM GDB:9955259 



4q21-4q21 



AMELOGENESIS IMPERFECTA 
2, HYPOPLASTIC LOCAL, 
AUTOSOMAL DOMINANT; 
AMELOGENESIS IMPERFECTA, ] 
HYPOPLASTIC TYPE 



ETFDH GDB: 135992 



4q32-4q35 



GLUTARICACIDURIA BC; GA 

nc 



EVC 



GDB:555573 



4pl6-4pl6 



ELLIS-VAN CREVELD 
SYNDROME; EVC 



10 



Fll 



IGDB: 119891 



4q35-4q35 



PTA DEFICIENCY 



FABP2 



15 



FGA 



GDB:1 19127 



4q28-4q31 



T ATTY ACID BINDING 
IPROTEIN 2, INTESTINAL; 
FABP2 



GDB:1 19129 



4q28-4q28 



.OIDOSIS, FAMILIAL 
IVISCERAL FIBRINOGEN, A 
ALPHA POLYPEPTIDE; FGA 



FGB 



GDB:1 19130 



4q28-4q28 



FIBRINOGEN, B BETA 
POLYPEPTIDE; FGB 



FGFR3 



20 



GDB:127526 



4pl6.3-4pl6.3 



ACHONDROPLASIA; ACH 
IBLADDER CANCER 
FIBROBLAST GROWTH 
FACTOR RECEPTOR-3; FGFR3 



FGG 



GDB:1 19132 



4q28-4q28 



FIBRINOGEN, G GAMMA 
POLYPEPTIDE; FGG 



FSHMD1A GDB:1 19914 



4q35-4q35 



25 



FACIOSCAPULOHUMERAL 
MUSCULAR DYSTROPHY 1A; 
FSHMD1A 



GC 



GDB: 119263 



4ql2-4ql3 
4ql2-4ql2 



GROUP-SPECIFIC 
COMPONENT; GC 



GNPTA GDB: 1 1 9280 



4q21-4q23 



MUCOLIPIDOSIS n; ML2; ML B 



30 



GNRHR GDB: 136456 



4ql3-4ql3 
i21.2-4q21.2 



GONADOTROPIN-RELEASING 
HORMONE RECEPTOR; GNRHRl 



GYPA GDB: 118890 



|4q28-4q31 BLOOD GROUP-MN LOCUS; 
4q28.2-4q31.1 MM 



HCA 



GDB:9954675 4q33-4qter 



HYPERCALCIURIA, FAMILIAL 
IDIOPATHIC 



35 HCL2 GDB: 119305 I4q28-4q3 1 4q-4g |HAIR COLOR-2; HCL2 



-67- NY2-J3213SS.I 



2 15 £5 .OV2tgJ 



Gene 


GDB Accession ID 


Location 


OMIM Link 


HD 


GDB:1 19307 


4pl6.3-4pl6.3 


HUNTINGTON DISEASE; HD 


HTN3 


GDB: 125601 


4ql2-4q21 


HISTATIN-3; HTN3 


HVBS6 


GDB:120687 


4q32-4q32 


HEPATOCELLULAR 
CARCINOMA-2; HCC2 


IDUA 


GDB: 119327 


4pl6.3-4pl6.3 


MUCOPOLYSACCHARIDOSIS 
TYPE I; MPS I 


IF 


GDB: 120077 


4q24-4q25 
4q25-4q25 


COMPLEMENT COMPONENT-3 
INACTTVATOR, DEFICIENCY 
OF 


JPD 


GDB.120113 


4pter-4qter 
4ql2-4ql3 


PERIODONTITIS, JUVENILE; 
JPD 


KIT 


GDB: 1201 17 


4ql2-4ql2 


V-KIT HARDY-ZUCKERMAN 4 
FELINE SARCOMA VIRAL 
ONCOGENE HOMOLOG; KIT 


KLKB1 


GDB: 127575 


4q34-4q35 
4q35-4q35 


FLETCHER FACTOR 
DEFICIENCY 


LQT4 


GDB-.682072 


4q25-4q27 


SYNDROME WITHOUT 

PSYCHOMOTOR 

RETARDATION 


MANBA 


GDB: 125261 


4q21-4q25 


MANNOSIDOSIS, BETA; 
MANB1 


MLLT2 


GDB.136792 


4q21-4q21 


MYELOID/LYMPHOID OR 
MIXED LINEAGE LEUKEMIA, 
TRANSLOCATED TO, 2; MLLT2 


MSX1 


GDB: 120683 


4pl6.3-4pl6.1 
4pl6.1-4pl6.1 


MSH, DROSOPHILA, HOMEO 
BOX, HOMOLOG OF, 1; MSX1 


MTP 


GDB:228961 


4q24-4q24 


MICROSOMAL TRIGLYCERIDE 
TRANSFER PROTEIN, 88 KD; 
MTP 


NR3C2 


GDB:120188 


4q31-4q31 
4q3l.l-4q31.1 


PSEUDOHYPOALDOSTERONIS 
M, TYPE I, AUTOSOMAL 
RECESSIVE; PHA1 


PBT 


GDB: 120260 


4ql2-4q21 


PIEBALD TRAIT; PBT 



35 
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Gene 


GDB Accession ID 


Location 


OMIM Link 


5 


PDE6B 


GDB:125915 


4pl6.3-4pl6.3 


NIGHTBLINDNESS, 
CONGENITAL STATIONARY; 
CSNB3 PHOSPHODIESTERASE 
6B, cGMP-SPECIFIC, ROD, 
BETA; PDE6B 




PEE1 


GDB:7016765 


4q31-4q34 
4q25-4qter 


1; PEE1 


10 


PITX2 


GDB:134770 


4q25-4q27 
4q25-4q26 
4q25-4q25 


IRIDOGONIODYSGENESIS, 
TYPE 2; IRID2 RIEGER 
SYNDROME, TYPE 1; RIEG1 
RffiG BICOID-RELATED 
HOMEOBOX TRANSCRIPTION 
FACTOR 1; RIEG1 HOMEO BOX 


15 


PKD2 


GDB: 118851 


4q21-4q23 


POLYCYSTIC KIDNEY 
DISEASE 2; PKD2 




QDPR 


GDB:120331 


4pl5.3-4pl5.3 
4pl5.31-4pl5.31 


PHENYLKETONURIA II 


20 


SGCB 


GDB:702072 


4ql2-4ql2 


MUSCULAR DYSTROPHY, 
LIMB-GIRDLE TYPE2E- 
LGMD2E 


SLC25A4 


GDB: 119680 


4q35-4q35 


ADENINE NUCLEOTIDE 
TRANSLOCATOR 1; ANT1 

OPHTHALMOPLEGIA; PEO 


25 


SNCA 




4q21-4q21 


QVXTT T/^T "RTKT A T DU A • CTVTf^ A 

PARKINSON DISEASE, 
FAMILIAL, TYPE 1; PARK1 




SOD3 


GDB: 125291 


4pl6.3-4q21 


SUPEROXIDE DISMUTASE, 
EXTRACELLULAR; SOD3 




STATH 


GDB:120391 


4qll-4ql3 


STATHERIN; STATH; STR 


30 


TAPVR1 


GDB-392646 


4nl3-4all 


ANOMAT OIT«s PTTT MfYWAPV 

VENOUS RETURN; APVR 




TYS j 


GDB:1 19624 j 


4q-4q 


SCLEROTYLOSIS; TYS 




WBS2 


GDB: 132426 


4q33-4q35.1 


WILLIAMS-BEUREN 
SYNDROME; WBS 


35 


WFS1 


GDB:434294 


4p-4p 4pl6-4pl6 


DIABETES MELLITUS AND 
INSIPIDUS WITH OPTIC 
ATROPHY AND DEAFNESS 
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Gene 


GDB Accession ID 


Location 


OM1M Link 


WHCR 


GDB:125355 


4pl6.3-4pl6.3 


WOLF-HIRS CHHORN 
SYNDROME; WHS 



5 Table 6: Genes, Locations and Genetic Disorders on Chromosome 5 





Gene ] 


L»u J3 Accession iaj i 


OMIM Link _J 




A T*\ A TV XTCI 1 

ADAM 1 Sz 1 




EHLERS-DANLOS SYNDROME, 
TYPE VH, AUTOSOMAL RECESSIVE 


10 


ADRB2 | 


GDB:120541 1 


BETA-2-ADRENERGIC RECEPTOR; 
ADRB2 




AMCN 1 


GDB:9836823 I 
i I 


ARTHROGRYPOSIS MULTIPLEX I 
CONGENITA, NEUROGENIC TYPE j 


15 


AP3B1 I 


GDB:9955590 1 


HERMANSKY-PUDLAK 1 
SYNDROME; HPS 


APC | 


GDB: 119682 1 


ADENOMATOUS POLYPOSIS OF 1 
THE COLON; APC 




ARSB 1 


GDB:1 19008 1 


MUCOPOLYSACCHARIDOSIS TYPE I 
VI; MPS VI 1 




B4GALT7 


GDB-.9957653 


SYNDROME, PROGEROID FORM 


20 


BHR1 


GDB:9956078 


ASTHMA _j 




|C6 


GDB: 11 9045 


COMPLEMENT COMPONENT-6, 
DEFICIENCY OF 




|C7 


1 GDB: 119046 


COMPLEMbNl CUMrUlNtiiNi-/, I 
DEFICIENCY OF 


25 


|CCAL2 


1 GDB:5584265 


I CHONDROCALCINOSIS, FAMILIAL 1 
ARTICULAR 




1 CKN1 


1 GDB: 128586 


COCKAYNE SYNDROME, TYPE I; 1 
CKN1 1 


30 


1 CMDJ 


1 GDB:9595425 


CRANIOMETAPHYSEAL 1 
DYSPLASIA, JACKSON TYPE; CMDJ 




1 CRHBP 


j GDB: 127438 


CORTICOTROPIN RELEASING | 
HORMONE-BINDING PROTEIN; 

CRHBP 1 




1 CSF1R 


1 GDB: 120600 


COLONY-STIMULATING FACTOR-1 1 
RECEPTOR; CSF1R _| 



-70- 
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Gene 


GDB Accession ID 


OMIM Link 




DHFR 


GDB: 119845 


DIHYDROFOLATE REDUCTASE; 
DHFR 


5 


DIAPH1 


GDB:9835482 


DEAFNESS, AUTOSOMAL 
DOMINANT NONSYNDROMIC 
SENSORINEURAL, 1; DFNA1 
DIAPHANOUS, DROSOPHILA, 
HOMOLOG OF. 1 




DTR 


GDB: 119853 


DIPHTHERIA TOXIN SENSITIVITY; 
DTS 


10 


EOS 


GDB:9956083 


EOSINOPHILIA, FAMILIAL 




ERVR 


GDB:9835857 


HYALOIDEORETINAL 
DEGENERATION OF WAGNER 




F12 


GDB: 11 9892 


HAGEMAN FACTOR DEFICIENCY 


15 


FBN2 


GDB: 128122 


CONTRACTURAL 
ARACHNODACTYLY, 
CONGENITAL; CCA 




GDNF 


GDB.450609 


GLIAL CELL LINE-DERIVED 
NEUROTROPHIC FACTOR; GDNF 


20 


GHR 


GDB:1 19984 


GROWTH HORMONE RECEPTOR; 
GHR 






V_JL/ 13 . 1 1 OOU 1 


GLYCINE RECEPTOR, ALPHA- 1 
SUBUNIT; GLRA1 KOK DISEASE 




GM2A 


GDB: 120000 


TAY-SACHS DISEASE, AB VARIANT 


25 




KJLJD . 1 1 7jwO 


SANDHOFF DISEASE 






17-@BETA-HYDROXYSTEROID 
DEHYDROGENASE IV; HSD17B4 




ITGA2 


GDB: 128031 


INTEGRIN, ALPHA-2; ITGA2 




KFS 


GDB:9958987 


VERTEBRAL FUSION 


30 


LGMD1A 


GDB:1 18832 


MUSCULAR DYSTROPHY, 
LIMB-GIRDLE, TYPE 1A; LGMD1A 




LOX 


GDB: 119367 


LYSYL OXIDASE; LOX 




LTC4S 


GDB.-384080 


LEUKOTRIENE C4 SYNTHASE; 
LTC4S 



35 
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£5 .072WS 



Gene 



MAN2A1 



GDB Accession ID 



GDB:136413 



OMIM Link 



MANNOSIDASE, ALPHA, II; MANA2 
DYSERYTHROPOIETIC ANEMIA, 
CONGENITAL, TYPE II 



MCC 



GDB:128163 



MUTATED IN COLORECTAL 
CANCERS; MCC 



MCCC2 



GDB:135990 



II 



MSH3 



GDB:641986 



MutS, E. COLI, HOMOLOG OF, 3; 
MSH3 



10 



MSX2 



GDB: 138766 



MSH (DROSOPHILA) HOMEO BOX 
HOMOLOG 2; MSX2 PARIETAL 
ORAMINA, SYMMETRIC; PFM 



i NR3C1 



PCSK1 



15 



PDE6A 



GDB:120017 



GDB: 128033 



GDB: 120265 



GLUCOCORTICOID RECEPTOR; GRL 



PROPROTEIN CONVERTASE 
SUBTILISIN/KEXIN TYPE 1; PCSK1 



PHOSPHODIESTERASE 6A, 
cGMP-SPECIFIC, ROD, ALPHA; 
PDE6A 



PFBI 



GDB:9956096 



INTENSITY OF INFECTION IN 



20 RASA1 



GDB:120339 



RAS p21 PROTEIN ACTIVATOR 1; 
RASA1 



SCZD1 



GDB: 120370 



DISORDER- 1; SCZD1 



SDHA 



25 



SGCD 



SLC22A5 



SLC26A2 



30 



GDB:378037 



GDB:5886421 



GDB:9863277 



GDB:125421 



SUCCINATE DEHYDROGENASE 
COMPLEX, SUBUNIT A, 
FLAVOPROTEIN; SDHA 



SARCOGLYCAN, DELTA; SGCD 



CARNITINE DEFICIENCY, 
SYSTEMIC, DUE TO DEFECT IN 
RENAL REABSORPTION 



DIASTROPHIC DYSPLASIA; DTD 
EPIPHYSEAL DYSPLASIA, 
MULTIPLE; MED NEONATAL 
OSSEOUS DYSPLASIA I 
ACHONDROGENESIS, TYPE IB; 
ACG1B . 



35 



72 
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Gene 


GDB Accession ED 


OMIM Link 




SLC6A3 


GDB: 132445 


SOLUTE CARRIER FAMILY 6, 
MEMBER 3; SLC6A3 
DEFICIT-HYPERACTTVITY 

rvro/'YD niii) . A r>T4T) 
UloLlKJL'JtlK, ALftUJ 


5 


SMI 


GDB:9834488 


SCHISTOSOMA MANSONI 
SUSCEPTIBILrrY/RESISTANCE 




SMA@ 


GDB: 120378 


SPINAL MUSCULAR ATROPHY I; 
SMA I SURVIVAL OF MOTOR 
NEURON 1, TELOMERIC; SMN1 


10 


SMN1 


GDB:5215173 


SPINAL MUSCULAR ATROPHY I; 

SMA 1 oUKVlv AL ur iviv_/ 1 wrv 

NEURON 1, TELOMERIC; SMN1 




SMN2 


GDB-.5215175 


SPINAL MUSCULAR ATROPHY I; 

oiViTA t CTFDX/n/AT C\T* "MOTOR 

NEURON 2, CENTROMERIC; SMN2 




SPINK5 


GDB:9956114 


NETHERTON DISEASE 


15 


TCOF1 


GDB.127390 


TREACHER 

COLLINS-FRANCESCHETTI 

QVMnRflMF 1 • TCOF1 


20 


TGFBI 


GDB:597601 


CORNEAL DYSTROPHY, 
GRANULAR TYPE CORNEAL 
DYSTROPHY, LATTICE TYPE I; 
CDL1 TRANSFORMING GROWTH 
FACTOR, BETA-INDUCED, 68 KD; 
TGFBI 




Table 7: Gen 


es, Locations and Genetic Disorders on Chromosome 6 


25 


Gene 


GDB Accession ID 


OMIM Link 




ALDH5A1 


GDB:454767 


SUCCINIC SEMIALDEHYDE 
DEHYDROGENASE, 
NAD(+>DEPENDENT; SSADH 




ARG1 


GDB: 119006 


ARGININEMIA 


30 


AS 


GDB: 135697 


ANKYLOSING SPONDYLITIS; AS 


ASSP2 


GDB:1 19017 


CITRULLINEMIA 




BCKDHB 


GDB: 118759 


MAPLE SYRUP URINE DISEASE, TYPE 
IB 


35 


BF 


GDB.l 19726 


GLYCINE-RICH 
BETA-GLYCOPROTEIN; GBG 
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333 





Gene 


GDB Accession ID 


OMIM Link 




C2 


GDB:119731 


COMPLEMENT COMPONENT-2, 
DEFICIENCY OF 


5 


C4A 


GDB: 119732 


COMPLEMENT COMPONENT 4A; C4A 




CDKN1A 


GDB.-266550 


CYCLES-DEPENDENT KINASE 
INHIBITOR 1A; CDKN1 A 




COL10A1 


GDB:128635 


COLLAGEN, TYPE X, ALPHA 1 ; 
COL10A1 


10 


COL11A2 


GDB: 119788 


COLLAGEN, TYPE XI, ALPHA-2; 
COL11A2 STICKLER SYNDROME, 
TYPE H; STL2 DEAFNESS, 
AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 
13; DFNA13 


15 


CYP21A2 


GDB:120605 


ADRENAL HYPERPLASIA, 

CONGENITAL, DUE TO 

21 -HYDROXYLASE DEFICIENCY 






ODB-437584 


DYSLEXIA, SPECIFIC, 2; DYX2 




EJM1 


GDB.l 19864 


MYOCLONIC EPILEPSY, JUVENILE; i 
EJM1 


20 


ELOVL4 


GDB: 11499609 


STARGARDT DISEASE 3; STGD3 




EPM2A 


GDB:3763331 


EPILEPSY, PROGRESSIVE 
MYOCLONIC 2; EPM2 




ESR1 


GDB:1 19120 


ESTROGEN RECEPTOR; ESR 


25 


EYA4 


GDB:700062 


DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 
10; DFNA10 




F13A1 


GDB: 1206 14 


FACTOR XIU, Al SUBUNIT; F13A1 




FANCE 


GDB: 1220236 


FANCONI ANEMIA, 
COMPLEMENTATION GROUP E; FACE 


30 


GCLC 


GDB:132915 


GAMMA-GLUT AMYLCYSTEINE 
SYNTHETASE DEFICIENCY, 
HEMOLYTIC ANEMIA DUE 




GJA1 


GDB.125196 


GAP JUNCTION PROTEIN, ALPHA- 1, 
43KD;GJA1 




GLYS1 


GDB-.136421 


GLYCOSURIA, RENAL 



74 



NY2.-132135S.1 





Gene I 


GDB Accession ID 


OMIM Link 1 




GMPR 1 


GDB:127058 


GUANINE MONOPHOSPHATE I 
REDUCTASE 


5 


GSE 1 


GDB:9956235 


r\tcc AQP* C^W 1 
JJlOCrAoCf, y-sLJ ^Jj 




HCR 1 


GDB:9993306 


PSORIASIS, SUSCEPTIBILITY TO 
uT?r?r^rM}-1 1 Q70Q S 

HEMOCHROMATOSIS; HFE 


10 


HLA-A | 


GDB:1 19310 


HLA-A HISTOCOMPATIBILITY TYPE; 
m aahta T1PTU fiTOV 1 20636 LA-DP 1 
HISTOCOMPATIBILITY TYPE, BETA-1 
SUBUNIT J 




HLA-DRA 1 


GDB:120641 


HLA-DR HISTOCOMPATIBILITY 1 
TYPE; HLA-DRA [ 


15 


HPFH 1 


GDB:9849006 


HETEROCELLULAR HEREDITARY 1 
PERSISTENCE OF FETAL 
HEMOGLOBIN _J 




ICS1 1 


GDB: 136433 


MMOTILE CILIA IS Y JN jJKUMJi- 1,1^01 | 




IDDM1 


GDB:9953173 


DIABETES MELLITUS, 
JUVENILE-ONSET | 
INSULIN-DEPENDENT; IDDM 


20 


IFNGR1 


1 GDB: 120688 


INTERFERON, GAMMA, RECEPTOR-1; 
IFNGR1 




IGAD1 


| GDB.6929077 


SELECTIVE DEFICIENCY OF I 




IGF2R 


1 GDB.120083 


INSULIN-LIKE GROWTH FACTOR 2 1 
RECEPTOR; IGF2R 1 


25 


ISCW 


1 GDB.9956158 


SUPPRESSION; IS j 




LAMA2 


| GDB: 132362 


LAMINEM, ALPHA 2; LAMA2 j 




LAP 


j GDB:9958992 


LARYNGEAL ADDUCTOR I 
PARALYSIS; LAP 1 




LCA5 


| GDB:1 1498764 


AMAUROSIS CONGENITA OF LEBER I 


30 


LPA 


1 GDB: 120699 


APOLIPOPROTEIN(a); LPA 




MCDR1 


1 GDB:131406 


MACULAR DYSTROPHY, RETINAL, 1, 1 
NORTH CAROLINA TYPE; MCDR1 




MOCS1 


1 GDB.9862235 


MOLYBDENUM COFACTOR 
DEFICIENCY 1 



35 
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GDB Accession ID loMIMLink 


MUT |« 


GDB: 120204 1 ] 


METH YLMALON IC ACIDU RI A DUE TO 
METHYLMALONIC CoA MUTASE 
DEFICIENCY J 


Imyb 1 


ODB'l 19441 


V-MYB AVIAN MYELOBLASTOSIS 1 
VIRAL ONCOGENE HOMOLOG; MYB 


NEU1 J 


GDR' 120230 1 


NEURAMINIDASE DEFICIENCY 1 


NKS1 




SUSCEPTIBILITY TO LYSIS BY 1 
ALLOREACTIVE NATURAL KILLER | 
CEI LS- EC1 1 


| NYS2 j 


GDB:9848763 I 


NYSTAGMUS, CONGENITAL 


1 ° A3 1 


GDB: 136429 1 


ALBINISM, OCULAR, AUTOSOMAL I 
RECESSIVE; OAR 


1 ODDD I 


GDB-.6392584 1 


OCULODENTODIGITAL DYSPLASIA; 
ODDD 1 


| OFC1 


GDd: 121/24/ 


OROFACIAL CLEFT 1 ; OFC1 


1 PARK2 1 


| GDB:6802742 J 


PARKINSONISM, JUVENILE 


1 PBCA I 


1 GDB:9956321 


BETA CELL AGENESIS WITH 
1 NEONATAL DIABETES MELLITUS 


I 

PBCRA1 


1 GDB:3763333 


1 CHORIORETINAL ATROPHY, | 
| PROGRESSIVE BIFOCAL; CRAPB 1 


IPDBI 


1 GDB: 136349 


1 DISEASE OF BONE; PDB 


I PEX3 


1 rrDR-9955507 


1 ZELLWEGER SYNDROME; ZS 


; IPEX6 

1 


1 rrnR*SS92414 


ZELLWEGER SYNDROME; ZS 
PEROXIN-6; PEX6 J 


|PEX7 


1 GDB:6155803 


RHIZOMELIC CHONDRODYSPLASIA 
1 PUNCTATA; RCDP PEROXIN-7; PEX7 J 


PKHD1 


1 GDB:433910 


POLYCYSTIC KIDNEY AND HEPATIC 
DISEASE- 1 ; PKHD 1 __J 


PLA2G7 


1 GDB:9958829 


PLATELET-ACTIVATING FACTOR 
ACETYLHYDROLASE, SUBUNIT 


1 PLG 


| GDB: 119498 


j PLASMINOGEN; PLG | 


POLH 


1 GDB:6963323 


PIGMENTOSUM WITH NORMAL DNA 
j REPAIR RATES 1 



35 
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Gene 


GDB Accession ID 


OMIM Link 




PPAC 


GDB:9956248 


ARTHROPATHY, PROGRESSIVE 
PSEUDORHEUMATOID, OF 
CHILDHOOD 


5 ■ 


PSOPvSl 


GDB:6381310 


PSORIASIS, SUSCEPTIBILITY TO 




PUJO 


GDB:9956231 


MULTICYSTIC RENAL DYSPLASIA, 
BILATERAL; MRD 




RCD1 


GDB-.333929 


RETINAL CONE DEGENERATION 


10 


RDS 


GDB: 11 8863 


RETINAL DEGENERATION, SLOW; 
RDS 




RHAG 


GDB: 136011 


RHESUS BLOOD 
GROUP-ASSOCIATED 
GLYCOPROTEIN; RHAG RH-NULL, 
REGULATOR TYPE; RHN 


15 


RP14 


GDB:433713 


RETINITIS PIGMENTOSA-14; RP14 
TUBBY-LIKE PROTEIN 1 ; TULP1 




RUNX2 


GDB:392082 


CLEIDOCRANIAL DYSPLASIA; CCD 
CORE-BINDING FACTOR, RUNT 
DOMAIN, ALPHA SUBUNIT 1; CBFA1 


20 


K Wo 


frD'R-9956195 


SENSITIVITY 


or* A 1 


frDFM 19588 


SPINOCEREBELLAR ATAXIA 1; SCA1 






VJ VJ O * UJ J7H 


DISORDER-3; SCZD3 








SIALIC ACID STORAGE DISEASE; 
SIASD 


25 


SOD2 


GDB:1 19597 


SUPEROXIDE DISMUTASE 2, 
MITOCHONDRIAL; SOD2 




ST8 


GDB:61 18456 


OVARIAN TUMOR 




TAP1 


GDB:132668 


TRANSPORTER 1, ABC; TAP1 




TAP2 


GDB:132669 


TRANSPORTERS ABC; TAP2 


30 


TFAP2B 


GDB:681506 


DUCTUS ARTERIOSUS; PDA 
TRANSCRIPTION FACTOR AP-2 BETA; 
TFAP2B 




TNDM 


GDB.9956265 


DIABETES MELLITUS, TRANSIENT 
NEONATAL 


35 


TNF 


GDB:120441 


TUMOR NECROSIS FACTOR; TNF 
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Gene 


GDB Accession ID 


OMIM Link 






VJL/D.IZJJOO 


TROPHOBLAST GLYCOPROTEIN: 

IJroiJ, JViOJrl 


5 


TPMT 


GDB:209025 


THIOPURINE 

S-METHYLTRANSFERASE; TPMT 




TULP1 


GDB:61 99353 


TUBBY-LIKE PROTEIN 1 ; TULP1 


10 


WISP3 


GDB:9957361 


ARTHROPATHY, PROGRESSIVE 
PSEUDORHEUMATOID, OF 
CHILDHOOD 








Table 8: Genes, Locations and Genetic Disorders on Chromosome 7 




Gene 


GDB Accession ID 


OMIM Link 




AASS 


GDB: 11502144 


HYPERLYSINEM1A 


15 


ABCB1 


GDB:120712 


P-GLYCOPROTEIN-1; PGY1 




ABCB4 


GDB:120713 


P-GLYCOPROTEIN-3; PGY3 




ACHE 


GDB:1 18746 


ACETYLCHOLINESTERASE BLOOD 
GROUP-Yt SYSTEM; YT 


20 


AQP1 


GDB:129082 


AQUAPORIN-1-, AQP1 BLOOD 
GROUP--COLTON; CO 




ASL 


GDB: 11 9703 


ARGININOSUCCINICACIDURIA 




ASNS 


GDB: 119706 


ASPARAGINE SYNTHETASE; ASNS; AS 




AUTS1 


GDB:9864226 


DISORDER 


25 


BPGM 


GDB: 119039 


DIPHOSPHOGLYCERATE MUTASE 
DEFICIENCY OF ERYTHROCYTE 




C7orf2 


GDB: 10794644 


ACHEIROPODY 


30 


CACNA2D1 


GDB: 132010 


CALCIUM CHANNEL, 
VOLTAGE-DEPENDENT, L TYPE, 
ALPHA-2/DELTA SUBUNIT; 
MALIGNANT HYPERTHERMIA 
SUSCEPTIBILITY-3 




CCM1 


GDB:580824 


CEREBRAL CAVERNOUS 
MALFORMATIONS 1; CCM1 




CD36 


GDB:138800 


CD36 ANTIGEN; CD36 



35 
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Gene 


GDB Accession ID 


OMEVf Link 




CFTR 


GDB: 120584 


CYSTIC FIBROSIS; CF DEFERENS, 
CONGENITAL BILATERAL APLASIA 
OF; CBAVD; CAVD 


5 


CHORDOMA 


GDB: 11498328 






CLCN1 


GDB: 134688 


CHLORIDE CHANNEL 1, SKELETAL 
MUSCLE; CLCN1 


10 


CMH6 


GDB:9956392 


CARDIOMYOPATHY, FAMILIAL 
HYPERTROPHIC, WITH j 
WOLFF-P ARK1NSON- WHI 1 b 




CMT2D 


GDB:9953232 


CHARCOT-MAKib- 1 U(J 1 rl LHbiiAoti, 
NEURONAL TYPE, D 


15 


COL1A2 


GDB:1 19062 


COLLAGEN, TYPE I, ALPHA-2 
POLYPEPTIDE; COL1 A2 
OSTEOGENESIS IMPERFECTA TYPE I 
OSTEOGENESIS IMPERFECTA TYPE 
IV; OI4 




CRS 


GDB:1 19073 


CRANIOSYNOSTOS1S, TYPE 1; CRS1 




CYMD 


GDB:366594 


MACULAR EDEMA CYSTOID 


20 


DFNA5 


GDB:636174 


DEAFNEoO, AU 1 UallMAL UUMJiNAiN l 
NONSYNDROMIC SENSORINEURAL, 5; 
DFNA5 




DLD 


GDB: 120608 


LIPOAMIDE DEHYDROGbNAbri 
DEFICIENCY, LACTIC ACIDOSIS DUE 
TO 


25 


DYT11 


GDB-.10013754 


MYOCLONUS, HEREDITARY 
ESSENTIAL 




EEC1 


GDB: 13633 8 


ECTRODACTYLY, ECTODERMAL 
DYSPLASIA, AND CLEFT LIP/PALATE; 
EEC 


30 


T"»T *KT 

ELN 


/*»TVD . 1 1 A1 A*7 

GDB:l lyiuy 


T3T A CTTKT« T7T XT W7TT T T A \/fQ_RpT TPT? "Nf 

SYNDROME; WBS 


ETV1 


GDB:335229 


ETS VARIANT GENE 1; ETV1 




FKBP6 


GDB:99552l5 


WILLIAMS -BEUREN SYNDROME; WBS 


35 


GCK 


GDB:127550 


DIABETES MELLITUS, AUTOSOMAL 
DOMINANT, TYPE II GLUCOKINASE; 
GCK 
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Gene 


GDB Accession ID 


OMIMLink 1 




GHRHR 1 


GDB: 138465 1 


GROWTH HORMONE-RELEASING 1 
HnPMOisTP RFPFPTOR: GHRHR 1 


5 


GHS 1 


GDB:9956363 


MICROSOMIA WITH RADIAL DEFECTS 1 




GLI3 1 


GDB: 119990 1 


PALLISTER-HALL SYNDROME; PHS 
ot i VPT TPPFT F AMTT Y MEMBER 3 - 1 
GLI3 POSTAXIAL POLYDACTYLY, 
TYPE Al GREIG 
CEPHALOPOLYSYNDACTYLY 
SYNDROME; GCPS J 


10 [ 


GPDS1 1 


GDB:9956410 j 


GLAUCOMA, PIGMENT-DISPERSION 
TYPE j 




GUSB J 


GDB: 120025 J 


\ ji TfTSPOT Y<sAPPHARIDOSIS TYPE VII 1 


15 ( 


HADH 


GDB:120033 


DEHYDROGENASE/3-KETOACYL-CoA 
THIOLASE/ENO YL-Co A HYDRATASE, 




HLXB9 1 


,GDB:l3641l 


. UOA/ICO ROY fiPNP TTRQ- HLXB9 1 
SACRAL AGENESIS, HEREDITARY, 
WITH PRESACRAL MAbb, AN 1 fc-KlUK 
MENINGOCELE, — 1 




| HOXA13 


| GDB:120656 


| HOMEO BOX Al 3; HOXA1 3 | 


20 


1 HPFH2 


I GDB:l2807l 


1 HEREDITARY PERSIS 1 bNCfc, Ur \ 

FETAL HEMOGLOBIN, 
1 HETEROCELLULAR, INDIAN 




| HRX 


| GDB:9958999 


HRX 


25 


1 LAB 


I GDB: 1 1498909 


| ANEURYSM, INTRACRANIAL BERRY 1 


1 IMMP2L 


1 GDB: 11499195 


1 GILLES DE LA TOURETTE 1 
1 SYNDROME; GTS 




J KCNH2 


| GDB:138126 


1 LONG QT SYNDROME, TYPE 2; LQT2 1 




| LAMB1 


| GDB: 119357 


| LAMESFIN BETA 1; LAMB1 j 


30 


1 LEP 


| GDB: 136420 


1 T T?PTTM« T P?P I 
| -L»E/Jr 1 UN ) LEr | 




I MET 


| GDB: 1201 78 


| MET PROTO-ONCOGENE; MET 1 




Incfi 


1 GDB:120222 


GRANULOMATOUS DISEASE, 1 
CHRONIC, AUTOSOMAL 
| CYTOCHROME-b-POSIlWE FORM 


35 


Inm 


1 GDB: 119454 


1 NEUTROPHIL CHEMOTACTIC j 
j RESPONSE; NCR | 
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Gene 


GDB Accession ID 


OMEVf Link 




OGDH 


GDB: 118847 


ALPHA-KETOGLUTARATE 
DEHYDROGENASE DEFICIENCY 


5 


OPN1SW 


GDB: 11 9032 


TRITANOPIA 




PEX1 


GDB.-9787110 


ZELLWEGER SYNDROME; ZS 
PEROXIN-l;PEXl 




PGAM2 


GDB- 120280 


a rlvJor'rlwVjL i i^JtSiv/Y 1 fc* JV1U 1 Aob, 
DEFICIENCY OF M SUBUNTT OF 


10 


PMS2 


GDB'386406 


INCREASED (S. CEREVISIAE)-2; PMS2 




PON1 


GDB: 120308 


PARAOXONASE 1; PON1 


15 


PPP1R3A 


GDB -136797 


REGULATORY (INHIBITOR) SUBUNTT 
3;PPP1R3 




PR SSI 


LrL/o. X I yo^u 


PANCREATITIS, HEREDITARY; PCTT 
PROTEASE, SERINE, 1; PRSS1 | 




PTC 

-IT L V- 


\QHAA 


PHENYLTHIOCARBAMIDE TASTING 


20 


PTPN12 


GDB: 136846 


PROTEIN-TYROSINE PHOSPHATASE, 
NONRECEPTOR TYPE, 12; PTPN 12 




RP10 


GDB: 138786 


RETINITIS PIGMENTOSA-10; RP10 




RP9 


GDB:333931 


RETINITIS PIGMENTOSA-9; RP9 




SERPINE1 


GDB: 120297 


PLASMINOGEN ACTIVATOR 

TNTT-imfTi^TD TViyC T. n A Tl 

iiNrutJi l \jk, i YPJb 1; rAll 


25 


SGCE 


GDB.-9958714 


MYOCLONUS, HEREDITARY 
ESSENTIAL 




SHFM1 


GDB: 128 195 


SPLIT-HAND/FOOT DEFORMITY, TYPE 
I; SHFD1 


30 


SHH 


GDB:456309 j 


HOLOPROSENCEPHALY, TYPE 3; HPE3 
SONIC HEDGEHOG, DROSOPHILA, 
HOMOLOG OF; SHH 




SLC26A3 


GDB: 138165 


DOWN-REGULATED IN ADENOMA; 
DRA CHLORIDE DIARRHEA, 
FAMILIAL; CLD 


35 


SLC26A4 


GDB.-5584511 


PENDRED SYNDROME; PDS 
DEAFNESS, NEUROSENSORY, 
AUTOSOMAL RECESSIVE, 4; DFNB4 
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GDB Accession ID 


OMIM Link 




SLOS 


GDB:385950 


SMITH-LEMLI-OPITZ SYNDROME 




SMAD1 


GDB:3763345 


SPINAL MUSCULAR ATROPHY, 
DISTAL, WITH UPPER LIMB 
PREDOMINANCE; SMAD1 




TBXAS1 


GDB: 128744 


THROMBOXANE A SYNTHASE 1; 
TBXAS1 


10 


TWIST 


GDB: 135694 


ACROCEPHALOSYNDACTYLY TYPE 
HI TWIST, DROSOPHILA, HOMOLOG 
OF; TWIST 




ZWS1 


GDB:120511 


ZELLWEGER SYNDROME; ZS 




fable 9: Genes, Locations and Genetic Disorders on Chromosome 8 


15 


Gene 


GDB AccessionID 


OMIM Link 


ACHM3 


GDB:9120558 


PINGELAPESE BLINDNESS 




ADRB3 


GDB:203869 


BETA-3-ADRENijK*jIC KJDV_-iir l UK, aurd j 




ANK1 , 


GDB:118737 


SPHEROCYTOSIS, HEREDITARY; HS 


20 


CA1 


GDB:1 19047 


CARBONIC ANHYDRASE I, ERYTHROCYTE, 
ELECTROPHORETIC VARIANTS OF; CA1 




CA2 


GDB: 119739 


OSTEOPETROSIS WITH RENAL TUBULAR 
ACIDOSIS 




CCAL1 


GDB:5 12892 


CHONDROCALCINOSIS WITH 
EARLY-ONSET OSTEOARTHRITIS; CCAL2 


25 


CLN8 


GDB:252118 


EPILEPSY, PROGRESSIVE, WITH MENTAL 
RETARDATION; EPMR 




CMT4A 


GDB:138755 


CHARCOT-MARIE-TOOTH NEUROPATHY 
4A; CMT4A 




CNGB3 


GDB:9993286 


PINGELAPESE BLINDNESS 


30 


COH1 


GDB-.252122 


COHEN SYNDROME; COH1 


CPP 


GDB:1 19798 


CERULOPLASMIN; CP 




CRH 


GDB:1 19804 


CORTICOTROPIN-RELEASING HORMONE; 
CRH 


35 


CYP11B1 


GDB:120603 


ADRENAL HYPERPLASIA, CONGENITAL, 
DUE TO 1 1 -@BETA-H YDROXYLASE 
DEFICIENCY 
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GDB AccessionlD 


OMIM Link 


CYP11B2 

V/ All M. 


GDB:120514 


CYTOCHROME P450, SUBFAMILY XIB, 
POLYPEPTIDE 2; CYP1 1B2 


DECR1 


GDB:453934 


2,4-@DIENOYL-CoA REDUCTASE; DECR 


DPYS 


GDB:5885803 

S*** * <+S %J %JF \J 


DIHYDROPYRIMIDINASE; DPYS 


DURS1 


GDB.9958126 


DUANE SYNDROME 


EBS1 


GDB: 119856 


EPIDERMOLYSIS BULLOSA SIMPLEX, 
OGNA TYPE 


ECA1 


GDB: 107963 18 


JUVENILE ABSENCE 


EGI 


GDB.128830 


EPILEPSY, GENERALIZED, IDIOPATHIC; 
EGI 


EXTl 


GDB-.135994 


EXOSTOSES, MULTIPLE, TYPE I; EXTl 
CHONDROSARCOMA 


EYA1 


GDB:5215167 


BRANCHIOOTORENAL DYSPLASIA EYES 
ABSENT 1; EYA1 


FGFR1 


GDB: 119913 


ACROCEPHALOSYNDACTYLY TYPE V 
FIBROBLAST GROWTH FACTOR 
RECEPTOR-1; FGFR1 


GNRH1 


GDB: 133746 


GONADOTROPIN-RELEASING HORMONE 1; 
GNRH1 FAMILIAL HYPOGONADOTROPHIC 


GSR 


GDB: 119288 


GLUTATHIONE REDUCTASE; GSR 


GULOP 


GDB: 128078 


SCURVY 


HR 


GDB-.595499 


ALOPECIA UNIVERSALIS ATRICHIA WITH 
PAPULAR LESIONS HAIRLESS, MOUSE, 
HOMOLOG OF 


KCNQ3 


GDB:9787230 


CONVULSIONS, BENIGN FAMILIAL 
NEONATAL, TYPE 2; BFNC2 POTASSRJM 
CHANNEL, VOLTAGE-GATED, SUBFAMILY 
Q, MEMBER 3 


KFM 


GDB:265291 


KLIPPEL-FEIL SYNDROME; KFS ; KFM 


KWE 


GDB:9315120 


KERATOLYTIC WINTER ERYTHEMA ! 


LGCR 


GDB:120698 


LANGER-GIEDION SYNDROME; LGS 


LPL 


GDB: 120700 


HYPERLIPOPROTEINEMIA, TYPE I 


MCPH1 


GDB:9834525 


MICROCEPHALY; MCT 
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MOS 


GDB 1 19396 


TRANSFORMATION GENE: ONCOGENE 
MOS i 


MYC 


GDB: 120208 


TRANSFORMATION GENE: ONCOGENE 
MYC; MYC 


NAT1 


GDB- 125364 


ARYL AMIDE ACETYLASE 1; AAC1 


NAT2 


GDB: 125365 


ISONIAZID INACTTVATION 


NBS1 


GDB:9598211 


NIJMEGEN BREAKAGE SYNDROME 


PLAT 


GDB: 119496 


PLASMINOGEN ACTIVATOR, TISSUE; PLAT 


PLEC1 


GDB:41 19073 


EPIDERMOLYSIS BULLOSA SIMPLEX AND 
LIMB-GIRDLE MUSCULAR DYSTROPHY 
PLECTIN 1;PLEC1 


PRKDC 


GDB.234702 


SEVERE COMBINED IMMUNODEFICIENCY 
DISEASE-1 ; SCID1 PROTEIN KINASE, 
DNA-ACTIVATED, CATALYTIC SUBUNIT; 
PRKDC 


PXMP3 


GDB* 131487 


PEROXIN-2; PEX2 ZELLWEGER 
SYNDROME; ZS 


RP1 


GDB- 120352 


RETINITIS PIGMENTOSA-1; RP1 


SCZD6 


GDB:9864736 


DISORDER-2; SCZD2 


SFTPC 


GDB: 120373 


PULMONARY SURFACTANT APOPROTEIN 
PSP-C 


SGM1 


GDB: 135350 


KLIPPEL-FEIL SYNDROME; KFS; KFM 


SPG5A 


GDB:250332 


SPASTIC PARAPLEGIA-5A, AUTOSOMAL 
RECESSIVE; SPG5A 


STAR 


GDB:635457 


STEROIDOGENIC ACUTE REGULATORY 
PROTEIN; STAR 


TG 


GDB: 120434 


THYROGLOBULIN; TG 


TRPS1 


GDB:594960 


TRICHORHINOPHALANGEAL SYNDROME, 
TYPEI;TRPS1 | 


TTPA 


GDB:5 12364 


VITAMIN E, FAMILIAL ISOLATED 
DEFICIENCY OF; VED TOCOPHEROL 
(ALPHA) TRANSFER PROTEIN; TTPA 


VMD1 


GDB: 119631 


MACULAR DYSTROPHY, ATYPICAL 
VITELLIFORM; VMD1 
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WRN 


GDB: 128446 


WERNER SYNDROME; WRN 



Table 10: Genes, Locations and Genetic Disorders on Chromosome 



1 


Gene 


GDB AccessionID | 


OMIM Link I 




ABCA1 


GDB.305294 I 


ANALPHALIPOPROTEINEMIA 
ATP -BINDING CASSETTE 1 ; ABCl 


10 


ABL1 


GDB: 119640 J 


ABELSON MURINE LEUKEMIA VIRAL 1 
ONCOGENE HOMOLOG 1 ; ABL1 




ABO 


GDB:1 18956 j 


ABO BLOOD GROUP; ABO 




ADAMTS13 


GDB.9956467 


THROMBOCYTOPENIC PURPURA 




AK1 


GDB: 119664 [ 


ADENYLATE KINASE- 1; AK1 1 


1 ^ 1 
1 


ALAD 


GDB: 119665 


DELTA-AMINOLEVULINATE 
DEHYDRATASE; ALAD _J 




ALDH1A1 


GDB: 119667 | 


ALDEHYDE DEHYDROGENASE- 1 ; ALDH1 1 




ALDOB 


GDB: 119669 ] 


FRUCTOSE B^TOLERANCE, HEREDITARY I 




AMBP 


GDB: 120696 


PROTEIN HC; HCP 


20 


AMCD1 


GDB:437519 


ARTHROGRYPOSIS MULTIPLEX 
1 CONGENITA, DISTAL, TYPE 1 , AMUL> 1 | 




| ASS 


GDB:1 19010 


| CITRULLINEMIA I 




1 BDMF 


GDB-.9954424 


BONE DYSPLASIA WITH MEDULLARY 1 
| FIBROSARCOMA 1 


25 


1 BSCL 


GDB:9957720 


1 CT7TP QVKTTTR OTV/TP 1 
1 olLIx O X IN xJ XV w l vi l^i i 




|C5 


GDB: 119734 


COMPLEMENT COMPONENT-5, 
1 DEFICIENCY OF 


30 


1 CDKN2A 


GDB.335362 


MELANOMA, CUTANEOUS MALIGNANT, 2; 
CMM2 CYCLIN-DEPENDENT KINASE 
1 INHIBITOR 2A; CDKN2A 1 




| CHAC 


GDB:6268491 


j CHOREOACANTHOCYTOSIS; CHAC _J 




| CHH 


GDB:138268 


1 CARTILAGE-HAIR HYPOPLASIA; CHH ] 




| CMD1B 


GDB:677147 


1 CARDIOMYOPATHY, DILATED IB; CMD1B 


35 


1 COL5A1 


GDB: 13 1457 


COLLAGEN, TYPE V, ALPHA- 1 
1 POLYPEPTIDE; COL5A1 | 
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Gene 



CRAT 
DBH 

DFNB11 



DFNB7 



GDB AccessionID 



OMIM Link 



GDB:359759 
GDB: 119836 

GDB: 1220 180 



CARNITINE ACETYLTRANSFERASE; CRAT 

DOPAMINE BETA-HYDROXYLASE, 
PLASMA; DBH 

DEAFNESS, NEUROSENSORY, 
AUTOSOMAL RECESSIVE, 7; DFNB7 



GDB:636178 



DEAFNESS, NEUROSENSORY, 
AUTOSOMAL RECESSIVE, 7; DFNB7 



10 I DNAU 



GDB: 11500297 



IMMOTILE CILIA SYNDROME- 1; ICS 1 



DYS 



GDB: 137085 



DYSAUTONOMIA FAMILIAL; DYS 



DYT1 



GDB: 119854 



DYSTONIA 1, TORSION; DYT1 



15 



EPB72 



GDB:137193 
GDB: 128993 



ENDOGLIN; ENG 

ERYTHROCYTE SURFACE PROTEIN BAND 
7.2; EPB72 STOMATOCYTOS1S I 



FANCC 



FBP1 

20 I FCMD 

FRDA 
GALT 



25 



GLDC 



GNE 



GDB:132672 



FANCONI ANEMIA, COMPLEMENTATION 
GROUP C; FACC 



GDB:141539 
GDB:250412 

GDB:1 19951 
GDB: 119971 



FRU CTOSE- 1 ,6-BISPHOPHATASE 1;FBP1 

FUKUYAMA-TYPE CONGENITAL 
MUSCULAR DYSTROPHY; FCMD 

FR IEDREICH ATAX IA 1; FRDA1 

GALACTOSEMIA 



GDB: 128611 



GDB:9954891 



HYPERGLYCINEMIA, ISOLATED 
NON KETOTIC, TYPE I; NKH1 

INCLUSION BODY MYOPATHY; IBM2 



GSM1 
GSN 



30 



HSN1 



IBM2 



35 



LALL 



GDB-.9784210 
GDB: 12001 9 



GENIOSPASM 1; GSM1 
AMYLOIDOSIS V GELSOLIN; GSN 



HSD17B3 GDB:347487 



GDB:3853677 



GDB:3801447 



PSEUDOHERMAPHRODITISM, MALE, WITH 
GYN ECOMASTIA 

NEUROPATHY, HEREDITARY SENSORY, 
TYPE 1 

INCLUSION BODY MYOPATHY; IBM2 



GDB:9954426 



LEUKEMIA, ACUTE, WITH 
LYMPHOMATOUS FEATURES; LALL 
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Gene 


GDB AccessionlD 


OMIM Link j 




LCCS 


GDB.-386141 


LETHAL CONGENITAL CONTRACTURE I 
SYNDROME; LCCS 


5 


LGMD2H 


GDB:9862233 


DYSTROPHY, HUTTERITE TYPE | 




LMX1B 


GDB:9834526 1 


NAIL-PATELLA SYNDROME; NPS1 | 




MLLT3 


GDB-.138172 1 


MYELOID/LYMPHOID OR MIXED LINEAGE 1 
LEUKEMIA, TRANSLOCATED TO, 3 ; MLLT3 




MROS 


GDB:9954430 | 


MELKERS SON SYNDROME 


10 | 


MSSE 


GDB: 128019 | 


EPITHELIOMA, SELF-HEALING SQUAMOUS 1 




NOTCHl 


GDB: 13 1400 1 


NOTCH, DROSOPfflLA, HOMOLOG OF, 1 ; 
NOTCHl 




ORM1 


GDB: 120250 J 


OROSOMUCOID 1; OKMi J 


15 


PAPPA 


GDB: 134729 1 


PREGNANCY-ASSOCIATED PLASMA 1 
PROTEIN A; PAPPA J 




PIP5K1B 


GDB:686238 


FRIEDREICH A 1 AX1A 1 , r K-l>/\ l 1 




PTCH 


GDB: 119447 


vt a c% a x /tt r xrc\/T TO CVA.rn\T? OA/TP* IVNTS 1 

1 BASAL CELL NEV U5> o Y iNLMtuivin,, d^ino i 
PATCHED, DROSOPHILA HOMOLOG OF; 
PTCH 


20 


1 PTGS1 


GDB: 128070 


PROSTAGLANDIN-ENDOPEROXIDASE 1 
| SYNTHASE 1; PTGS1 1 




| RLN1 


GDB: 119552 


RELAXIN; RLN1 _j 




|RLN2 


GDB: 119553 


RELAXIN, OVARIAN, OF PREGNANCY j 


25 


1 RMRP 


GDB: 120348 


MITOCHONDRIAL RNA-PROCESSING 1 
ENDORIBONUCLEASE, RNA COMPONENT 

HYPOPLASIA; CHH j 


30 


IROR2 


GDB:136454 


BRACHYDACTYLY, TYPE B; BDB 
TCDRTNOW SYNDROME RECESSIVE FORM 
NEUROTROPHIC TYROSINE KINASE, 
RECEPTOR-RELATED 2; NTRKR2 f 


1 RPD1 


GDB.-9954440 


I RETINITIS PIGMENTOSA-DEAFNESS 1 
J SYNDROME 1 , AUTOSOMAL DOMINANT j 




| SARDH 


GDB:9835149 


1 SARCOSINEMIA j 




|tdfa 


GDB:9954420 


| FACTOR, AUTOSOMAL __J 



35 



r 
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5 



Gene 


GDB Accession!!) 


OMBM Link 


TEK 


GDB:344185 


VENOUS MALFORMATIONS, MULTIPLE 
CUTANEOUS AND MUCOSAL; VMCM TEK 
TYROSINE KINASE, ENDOTHELIAL; TEK 


TSC1 


GDB:120735 


TUBEROUS SCLEROSIS-1; TSC1 


TYRP1 


GDB: 126337 


TYROSINASE-RELATED PROTEIN 1; TYRP1 
ALBINISM EI XANTHISM 


XPA 


GDB:125363 


XERODERMA PIGMENTOSUM I 



Table 1 1 : Genes, Locations and Genetic Disorders on Chromosome 10 



15 



25 



30 



35 



Gene 


GDB Accession ID 


OMIM Link 


CACNB2 


GDB: 132014 


CALCIUM CHANNEL, 
VOLTAGE-DEPENDENT, BETA-2 
SUBUNIT; CACNB2 


COL17A1 


GDB: 13 1396 


COLLAGEN, TYPE XVII, ALPHA-1 
POLYPEPTIDE; COL17A1 


CUBN 


GDB:636049 


MEGALOBLASTIC ANEMIA 1; MGA1 


CYP17 


GDB: 119829 


ADRENAL HYPERPLASIA, 
CONGENITAL, DUE TO 
1 7-ALPHA-HYDROXYLASE 
DEFICIENCY 


CYP2C19 


GDB: 119831 


CYTOCHROME P450, SUBFAMILY 
IIC, POLYPEPTIDE 19; CYP2C19 


CYP2C9 


GDB: 13 1455 


CYTOCHROME P450, SUBFAMILY 
nC, POLYPEPTIDE 9; CYP2C9 


EGR2 


GDB:120611 


EARLY GROWTH RESPONSE-2; 
EGR2 


EMX2 


GDB:277886 


EMPTY SPIRACLES, DROSOPHILA, 
2, HOMOLOG OF; EMX2 


EPT 


GDB:97861 12 


EPILEPSY, PARTIAL; EPT 


ERCC6 


GDB:1 19882 


EXCISION-REPAIR 
CROSS-COMPLEMENTING RODENT 
REPAIR DEFICIENCY, 
COMPLEMENTATION 


FGFR2 


GDB: 127273 


ACROCEPHALOSYNDACTYLY 
TYPE V FIBROBLAST GROWTH 
FACTOR RECEPTOR-2; FGFR2 
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GDB Accession ID 


— ■ 

OMEYI Link 


HK1 


GDB: 120044 


HEXOKINASE-l;HKl 


HOXll 


GDB: 119607 


HOMEO BOX-1 1; HOX1 1 


HPS 


GDB: 127359 


HERMANSKY-PUDLAK 
SYNDROME; HPS 


IL2RA 


GDB: 11 9345 


INTERLEUKIN -2 RECEPTOR, 
ALPHA; IL2RA 


LGI1 


GDB:9864936 


EPILEPSY, PARTIAL; EPT 


LIPA 


GDB:120153 


WOLMAN DISEASE 


MAT1A 


GDB: 129077 


METHIONINE 

ADENOSYLTRANSFERASE 

DEFICIENCY 


MBL2 


GDB-.120167 


MANNOSE-BINDING PROTEIN, 
SERUM; MBP1 


MKI67 


GDB:120185 


PROLIFERATION-RELATED Ki-67 
ANTIGEN; MKI67 


MXI1 


GDB-.137182 


MAX INTERACTING PROTEIN 1 ; 
MXI1 


OAT 


GDB-.120246 


ORNITHINE AMINOTRANSFERASE 
DEFICIENCY 


OATL3 


GDB:2 15803 


ORNITHINE AMINOTRANSFERASE 
DEFICIENCY 


PAX2 


GDB:138771 


PAIRED BOX HOMEOTIC GENE 2; 
PAX2 


PCBD 


GDB:138478 


PTERIN-4-ALPHA-CARBINOLAMINE 
DEHYDRATASE; PCBD 
PRIMAPTERINURIA 


PEOl 


GDB:632784 


PROGRESSIVE EXTERNAL 
OPHTHALMOPLEGIA; PEO j 


PHYH 


GDB:9263423 


REFSUM DISEASE 
PHYTANOYL-CoA HYDROXYLASE; 
PHYH 


PNLIP 


GDB: 127916 


LIPASE, CONGENITAL ABSENCE OF 
PANCREATIC 


PSAP 


GDB: 120366 


PROSAPOSIN; PSAP 
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Gene 


GDB Accession ID 


OMIM Link 


PTEN 


GDB:6022948 


MACROCEPHALY, MULTIPLE 
LIPOMAS AND HEMANGIOMATA 
MULTIPLE HAMARTOMA 
SYNDROME; MHAM POLYPOSIS, 
JUVENILE INTESTINAL 
PHOSPHATASE AND TENSIN 
HOMOLOG; PTEN 


RBP4 


GDB: 120342 


RETINOL-BINDING PROTEIN, 
PLASMA; RBP4 


RDPA 


GDB:9954445 


REFSUM DISEASE WITH 
INCREASED PJPECOLICACIDEMIA; 
RDPA 


RET 


GDB-.120346 


RET PROTO-ONCOGENE; RET 


SDF1 


GDB:433267 


STROMAL CELL-DERIVED FACTOR 
1; SDF1 


SFTPA1 


GDB: 119593 


PULMONARY SURFACTANT 
APOPROTEIN PSP-A; PSAP 


SFTPD 


GDB: 132674 


PULMONARY SURFACTANT 
APOPROTEIN PSP-D; PSP-D 


SHFM3 


GDB:386030 


SPLIT-HAND/FOOT 
MALFORMATION, TYPE 3; SHFM3 


SIAL 


GDB.6549924 


NEURAMINIDASE DEFICIENCY 


THC2 


GDB:10794765 


THROMBOCYTOPENIA 


TNFRSF6 


GDB:132671 


APOPTOSIS ANTIGEN 1; APT1 


UFS 


GDB:6380714 


UROFACIAL SYNDROME; UFS 


UROS 


GDB:128112 


PORPHYRIA, CONGENITAL 
ERYTHROPOIETIC; CEP 



10 



15 



20 



25 



Table 12: 



Genes, Locations and Genetic Disorders on Chromosome 1 1 



Gene 


GDB Accession ID 


OMIM Link 


AA 


GDB:568984 


ATROPHIA AREATA; AA 


ABCC8 


GDB:591370 


SULFONYLUREA RECEPTOR; SUR 
PERSISTENT HYPERINSULINEMIC 
HYPOGLYCEMIA OF INFANCY 


AC ATI 


GDB: 126861 


ALPHA-METHYLACETOACETIC ACIDURIA 


ALX4 


GDB: 10450304 


PARIETAL FORAMINA, SYMMETRIC; PFM 



30 



35 
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10 



15 



1 Gene 


GDB Accession ID | < 


OMIM Link 


1 AMPD3 


GDB.136013 I- 


ADENOSINE MONOPHOSPHAlfc 
DEAMINASE-3; AMPD3 


ANC 


GDB.9954484 


CANAL CARCINOMA 


APOA1 


GDB:1 19684 1 


AMYLOIDOSIS, FAMILIAL VISCERAL 
APOLIPOPROTEIN A-I OF HIGH DENSITY | 
LIPOPROTEIN; APOA1 


1 APOA4 


GDB:1 19000 | 


APOLIPOPROTEIN A-IV; APOA4 J 


1 APOC3 


GDB: 119001 \ 


APOLIPOPROTEIN C-HT, APOC3 


1 ATM 


GDB:593364 [ 


ATAXIA-TELANGIECTASIA; AT 


1 BSCL2 


GDB:9963996 | 


SEIP SYNDROME 1 


Ibws 


GDB:120567 1 


BECKWITH- WIEDEM AN N SYNDKUMfc., 
BWS 


I CALCA 


GDB: 120571 1 


C ALCITONIN/C ALCITOMN -RELA 1 bU 
POLYPEPTIDE, ALPHA; CALCA 


CAT 


GDB:1 19049 


CATALASE; CAT 


IcCNDl 


GDB: 128222 


LEUKEMIA, CHRONIC LYMPHATIC; CLL 1 
CYCLIN Dl ; CCND 1 


ICD3E 


GDB: 119764 


CD3E ANTIGEN, EPSILON POLYPEPTIDE; 1 
CD3E 


1 CD3G 


GDB:1 19765 


T3 T-CELL ANTIGEN, GAMMA CHAIN; 
T3G; CD3G 1 


|CD59 


GDB:1 19769 


CD59 ANTIGEN P 1 8-20; CD59 HUMAN 
1 LEUKOCYTE ANTIGEN MIC1 1 ; MIC1 1 J 


1 CDKN1C 


GDB:593296 


CYCLIN-DEPENDENT KINASE INHIBITOR 
1C; CDKN1C 


CLN2 


VjJJJd. IZDZZo 


CEROID-LIPOFUSCINOSIS, NEURONAL 2, J 
LATE INFANTILE TYPE; CLN2 


3 ICNTF 


GDB:125919 


1 CILIARY NEUROTROPHIC FACTOR; CNTF 1 


1 CPT1A 


GDB.597642 


HYPOGLYCEMIA, HYPOKETOTIC, WITH 1 
DEFICIENCY OF CARNITINE 
PALMITOYLTRANSFERASE CARNITINE 
PALMITOYLTRANSFERASE I, LIVER; j 
CPT1A 1 



35 
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32 







\JXAD rVvlrCSalUlI xxj 


OMIM Link 


5 


^ 1 D*-* 


GDTV642214 


KERATOSIS PALMOPLANTARIS WITH 
PERIODONTOPATHIA KERATOSIS 

OAT TVyTfYDT A"NTTAT?TG \a7TTT-J 

PERIODONTOPATHIA AND 
ONYCHOGRYPOSIS CATHEPSIN C; CTSC 




DDBl 


GDB:595014 


DNA DAMAGE-BINDING PROTEIN; DDBI 




DDB2 


GDB:595015 


DNA DAMAGE-BINDING PROTEIN-2; DDB2 


10 


DHCR7 


GDB.9835302 


SMITH-LEMLI-OPITZ SYNDROME 


DLAT 


GDB:1 18785 


CIRRHOSIS, PRIMARY; PBC 




DRD4 


GDB:127782 


DOPAMINE RECEPTOR D4; DRD4 




ECB2 


GDB:9958955 


POLYCYTHEMIA, BENIGN FAMILIAL 






GDB-9837373 


DYSPLASIA MARGARITA TYPE 


15 


FVR1 

X-r V XV 1 


GDR- 114029 


EXUDATIVE VITI^ORETTNOPATHY, 
FAMILIAL; EVR 

EXT2GDB:344921EXOSTOSES, MULTIPLE, 

TVPT7 TT« T7YT) OT-TOXTTM? HQ A PPHA/f A 




F2 




C*f\ A m TT A TTOXT TT A PTHD TT» T70 


20 


FSHB 




T?r\T T T/^T XJ CTTNyfT TT ATTMfi UnD A/f fYMT? 
r LlJLLflLyJLli-o 1 lIVlUJbA. 1 UN vjr Jrl\Jl\iVlvJlN n, 

BETA POLYPEPTIDE; FSHB 




FTH1 


GDB:120617 


FERRITIN HEAVY CHAIN 1; FTH1 




GIF 


GDB: 11 8800 


PERNICIOUS ANEMIA, CONGENITAL, DUE 
TO DEFECT OF INTRINSIC FACTOR 


25 


GSD1B 


GDB:9837619 


GLYCOGEN STORAGE DISEASE lb 




GSD1C 


GDB:9837637 


STORAGE DISEASE Ic 




HBB 


GDB: 119297 


HEMOGLOBIN-BETA LOCUS; HBB 




HBBP1 


GDB:120035 


HEMOGLOBIN-BETA LOCUS; HBB 


30 


HBD 


GDB: 11 9298 


HEMOGLOBIN—DELTA LOCUS; HBD 


HBE1 


GDB: 11 9299 


HEMOGLOBIN— EPS1LON LOCUS; HBE1 




HBG1 


GDB: 119300 


HEMOGLOBIN, GAMMA A; HBG1 




HBG2 


GDB: 119301 


HEMOGLOBIN, GAMMA G; HBG2 


35 


HMBS 


GDB:120528 


PORPHYRIA, ACUTE INTERMITTENT; AIP 


HND 


GDB:9954478 


HARTNUP DISORDER , •! 
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Gene 


GDB Accession D> 


OMIM Link 


HOMG2 


GDB:9956484 


MAGNESIUM WASTING, RENAL 


HRAS 


GDB: 120684 


BLADDER CANCER V-HA-RAS HARVEY 
RAT SARCOMA VIRAL ONCOGENE 
HOMOLOG; HRAS 


HVBS1 


GDB: 120069 


CANCER, HEPATOCELLULAR 


IDDM2 


GDB:128530 


DIABETES MELLITUS, 
INSULIN-DEPENDENT, 2 DIABETES 
MELLITUS, JUVENILE-ONSET 
INSULIN-DEPENDENT; EDDM 


IGER 


GDB.l 19696 


IgE RESPONSIVENESS, ATOPIC; IGER 


INS 


GDB:1 19349 


INSULIN; INS 


JBS 


GDB: 1201 11 


JACOBSEN SYNDROME; JBS 


KCNJ11 


GDB:7009893 


POTASSRJM CHANNEL, 
INWARDLY-RECTIFYING, SUBFAMILY J, 
MEMBER 11; KCNJ11 PERSISTENT 
HYPERINSULINEMIC HYPOGLYCEMIA OF 
INFANCY 


KCNJ1 


GDB:204206 


POTASSIUM CHANNEL, 
INWARDLY-RECTIFYING, SUBFAMILY J, 
MFMRFR 1- KCNT1 


KCNQ1 


GDB:741244 


LONG QT SYNDROME, TYPE 1; LQT1 


LDHA 


GDB: 120141 


LACTATE DEHYDROGENASE- A; LDHA 


LRP5 


GDB:9836818 


OSTEOPOROSIS-PSEUDOGLIOMA 
SYNDROME; OPPG HIGH BONE MASS 


MEN1 


GDB:120173 


MULTIPLE ENDOCRINE NEOPLASIA, TYPE 
1;MEN1 


MLL 


GDB:128819 


MYELOID/LYMPHOID OR MIXED-LINEAGE 
LEUKEMIA; MLL 


MTACR1 


GDB: 125743 


MULTIPLE TUMOR ASSOCIATED 
CHROMOSOME REGION 1; MTACR1 


MYBPC3 


GDB:579615 


CARDIOMYOPATHY, FAMILIAL 
HYPERTROPHIC, 4; CMH4 
MYOSIN-BINDING PROTEIN C, CARDIAC; 
MYBPC3 



10 
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35 
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Gene 


GDB Accession ID 


OMIM Link 


5 


MY07A 


GDB: 132543 


MYOSIN VIIA; MY07A DEAFNESS, 
NEUROSENSORY, AUTOSOMAL 
RECESSIVE, 2; DFNB2 DEAFNESS, 
AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 1 1 ; 
DFNA11 




JNINUl 


GDB- 104505 13 


SIMPLE, AUTOSOMAL DOMINANT 




OPPG 


GDB:3789438 


OSTEOPOROSIS-PSEUDOGLIOMA 
SYNDROME; OPPG 


10 


OPTB1 


GDB:9954474 


OSTEOPETROSIS, AUTOSOMAL 
RECESSIVE 




rnAO 


ODB- 118997 


PAIRED BOX HOMEOTIC GENE 6; PAX6 




PP 


GDB- 11 9472 


PYRUVATE CARBOXYLASE DEFICIENCY 


i « 
i j 


pnvi 


GDB -983 6634 


PYRUVATE DEHYDROGENASE COMPLEX, 
COMPONENT X 






GDB-511177 

VJJLS.U**sJL lilt 


PARAGANGLIOMAS, FAMILIAL 
NONCHROMAFFIN, 2; PGL2 




PGR 


GDB: 11 9493 


PROGESTERONE RESISTANCE 


20 


PORC 


GDB:128610 


PORPHYRIA, CHESTER TYPE; PORC 




PTH 


GDB: 119522 


PARATHYROID HORMONE; PTH 




PTS 


GDB: 118856 


6-@PYRUVOYLTETRAHYDROPTERIN 
SYNTHASE; PTS 


25 


PVRL1 


GDB:583951 


ECTODERMAL DYSPLASIA, CLEFT LIP 
AND PALATE, HAND AND FOOT 
DEFORMITY, DYSPLASIA, MARGARITA 
TYPE POLIOVIRUS RECEPTOR RELATED; 
PVRR 

JL V J.NJL V 








GLYCOGEN STORAGE DISEASE V 


30 


KAOl 


\JUD. 1 <iUJ 


RFCOMBINATION ACTIVATING GENE-1: 
RAG1 




RAG2 


GDB: 125 186 


RECOMBINATION ACTIVATING GENE-2; 
RAG2 




ROM1 


GDB:120350 


ROD OUTER SEGMENT PROTEIN-1; ROM1 




SAA1 


GDB: 120364 


SERUM AMYLOID AT, SAA1 


35 


SCA5 


GDB:378219 


SPINOCEREBELLAR ATAXIA 5; SCA5 




> 
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Gene 


GDB Accession ID 


OM1M Link 


SCZD2 


GDB: 118874 


DISORDER-2; SCZD2 


SDHD 


GDB: 132456 


PARAGANGLIOMAS, FAMILIAL 
NUNCrLKUMArrlN, 1, ruLl 


SERPING1 


GDB: 119041 


ANGIONEUROTIC EDEMA, HEREDITARY; 
HANE 


SMPD1 


GDB: 128 144 


NIEMANN-PICK DISEASE 


TCIRG1 


GDB:9956269 


OSTEOPETROSIS, AUTOSOMAL 






T FTIKFMTA. ACUTE T-CELL* ATL 


TECTA 


GDB:6837718 


DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL. 8; 
DFNA8 DEAFNESS, AUTOSOMAL 

SENSORINEURAL, 12; DFNA12 


TH 


GDB:1 19612 


TYROSINE HYDROXYLASE; TH 


TREH 


GDB:9958953 


TD T7XJ AT ACT? 

1 KiiHALAoii 


TSG101 


GDB:1313414 


TUMOR SUSCEPTIBILITY GENE 101; 

Tor 1 A1 


TYR 


GDB:120476 


ALBINISM I 


USH1C 


GDB: 132544 


USHER SYNDROME, TYPE IC; USH1C 


\/TVyfTY> 
V JVIL/Z 


otyr«i ^^70^ 
vJi/D .i j j /yo 


VTTFT T TFORM MACULAR DYSTROPHY: 
VMD2 


VRNI 

V J. VI NX 


GDB 1 35662 


VTTREORETINOPATHY, NEOVASCULAR 
INFLAMMATORY; VRNI 


WT1 


GDB: 120496 


FRASIER SYNDROME WILMS TUMOR; 
WT1 


WT2 


GDB: 118886 


MULTIPLE TUMOR ASSOCIATED 
CHROMOSOME REGION 1; MTACR1 


ZNF145 


GDB:230064 


PROMYELOC YTIC LEUKEMIA ZINC 
FINGER; PLZF 



Table 13: Genes, Locations and Genetic Disorders on Chromosome 12 



Gene 


GDB Accession ID 


OMIMLink 


A2M 


GDB:119639 


ALPHA-2-MACROGLOBULIN; A2M 
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Gene 


GDB Accession ID 


OMIM Link 


AAAS 


GDB-9954498 


GLUCOCORTICOID DEFICIENCY 
AND ACHALASIA 


AC ADS 


GDB* 11 8959 


ACYL-CoA DEHYDROGENASE, 
SHORT-CHAIN; ACADS 


ACLS 


GDB* 136346 


ACROCALLOSAL SYNDROME; ACLS 


ACVRL1 


GDB:230240 


OSLER-RENDU-WEBER SYNDROME 
2; ORW2 ACTTVIN A RECEPTOR, 
TYPE II-LIKE KINASE 1; ACVRL1 


ADHR 


GDB:9954488 


VITAMIN D-RESISTANT RICKETS, 
AUTOSOMAL DOMINANT 


ALDH2 


GDB: 119668 


ALDEHYDE DEHYDROGENASE-2; 
ALDH2 


AMHR2 


GDB.696210 


ANTI-MULLERIAN HORMONE TYPE 
II RECEPTOR; AMHR2 


AOM 


GDB: 118998 


STICKLER SYNDROME, TYPE I; STL1 


AQP2 


GDB: 141 853 


AQUAPORIN-2; AQP2 DIABETES 
INSIPIDUS, RENAL TYPE DIABETES 
INSIPIDUS, RENAL TYPE, 
AUTOSOMAL RECESSIVE 


ATD 


GDB -696353 


ASPHYXIATING THORACIC 
DYSTROPHY; ATD 


ATP2A2 


GDB:1 19717 


ATPase, Ca(2+)-TRANSPORTING, 
SLOW-TWITCH; ATP2A2 








C1R 


GDB: 119729 


COMPLEMENT COMPONENT-Clr, 
DEFICIENCY OF 


CD4 


GDB: 119767 


T-CELL ANTIGEN T4/LEU3; CD4 


CDK4 


GDB-.204022 


CYCLIN-DEPENDENT KINASE 4; 
CDK4 


CNA1 


GDB:252 H9 


CORNEA PLANA 1; CNA1 


COL2A1 


GDB: 119063 


STICKLER SYNDROME, TYPE I; STL1 
COLLAGEN, TYPE H, ALPHA- 1 
CHAIN; COL2A1 

ACHONDROGENESIS, TYPE II; ACG2 



10 



15 



20 



25 



30 



35 



-96- 



NY2- 1321355.1 





Gene 


GDB Accession ID 


OMIM Link 




CYP27B1 


GDB:9835730 


> SEUDOVITAMIN D DEFICIENCY 
RICKETS; PDDR 


5 


DRPLA 


GDB:270336 


i— w-i— »-v rrm\ a rrAn t tt"* f% a T T"> ATT TT*\/^\T T rVCT A. 

DENTATORUBRAL-P ALLIDOLU YolA 
N ATROPHY; DRPLA ! 




ENUR2 


GDB.666422 


ENURESIS, NOCTURNAL, 2; ENUR2 




FEOM1 r 


GDB:345037 


FIBROSIS OF EXTRAOCULAR 
MUSCLES, CONGENITAL; FEOM 


10 


FPF 


GDB:9848880 


PERIODIC FEVER, AUTOSOMAL 
DOMINANT 




GNB3 


GDB: 120005 


GUANINE NUCLEOTIDE-BINDING 
PROTEIN, BETA POLYPEPTIDE-3; 
GNB3 


15 


GNS 


GDB: 120006 


MUCOPOLYSACCHARIDOSIS Iirfc, 
HID 




HAL 


GDB: 120746 


HISTIDINEMIA 




HBP1 


GDB:701889 


B R ACH YD ACTYL Y WITH 
HYPERTENSION 


20 


HMGIC 


GDB:362658 


HIGH MOBILITY GROUP PROTEIN 
ISOFORM I-C; HMGIC 




HMN2 


GDB:9954508 


MUSCULAR ATROPHY, ADULT 
SPINAL 




HPD 


GDB: 135978 


TYROSINEMIA, TYPE III 


25 


IGF1 


GDB:120081 


INSULINLIKE GROWTH FACTOR 1; 
IGF1 




KCNA1 


GDB:127903 


POTASSRJM VOLTAGE-GATED 
CHANNEL, SHAKER-RELATED 
SUBFAMILY, MEMBER 




KERA 


GDB:252121 


CORNEA PLANA 2; CNA2 


30 


KRAS2 


GDB: 120 120 


it T/-Y r> A CO VTOCFCM 1? AT QAP POM A 

2 VIRAL ONCOGENE HOMOLOG; 
KRAS2 




KRT1 


GDB:128198 


KERATIN 1; KRT1 




KRT2A 


GDB:407640 


ICHTHYOSIS, BULLOUS TYPE 
KERATIN 2A; KRT2A 


35 


KRT3 


GDB: 136276 


KERATIN 3; KRT3 
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6D398 



Gene 



KRT4 



I KRT5 



| KRT6A 
KRT6B 



10 



KRTHB6 
LDHB 



15 



|LYZ 

MGCT 
MPE 
MVK 



20 MYL2 



I NS1 
OAP 



25 



PAH 



30 



PPKB 
PRB3 
PXR1 
RLS 



, RSN 



35 



SAS 



GDB Accession ID 



GDB: 120697 



GDB: 1281 10 



GDB:128111 
GDB: 1281 13 



GDB:702078 
I GDB-.120147 



GDB: 120160 



GDB:9954504 
G DB: 1201 91 
GDB: 134189 



GDB: 128829 



GDB:439388 
GDB: 120245 



GDB: 119470 



GDB-.696352 
GDB:1 19513 
GDB:433739 
GDB:11501392 



OMBMLink 



KERATIN 4; KRT4 



EPIDERMOLYSIS BULLOSA 
HERPETIFORMIS, DOWLING-MEARA 
TYPE KERATIN 5;KRT5 



KERATIN 6A; KRT6A 

KERATIN 6B; KRT6B 
PACHYONYCHIA CONGENITA, 
JACKSON-LAWLER TYPE 



MONILETHRIX KERATIN, HAIR 
BASIC (TYPE ID 6 

LACTATE DEHYDROGENASE-B; 
LDHB 



AMYLOIDOSIS, FAMILIAL VISCERAL 
I LYSOZYME; LYZ 



! TE STICULAR TUMORS 
MALIGNANT PROLIFERATION OF 
MEVALONICACIDURIA 



MYOSIN, LIGHT CHAIN, 
REGULATORY VENTRICULAR; 
MYL2 



[ NO ONAN SYNDROME 1; NS1 

OSTEOARTHROSIS, PRECOCIOUS; 
OAP 



PHENYLKETONURIA; PKU1 

PALMOPLANTAR KERATODERMA, 
BOTHNIAN TYPE; PPKB 

PAROTID SALIVARY 
GL YCOPROTEIN; Gl 

ZELLWEGER SYNDROME; ZS 
PEROXISOME RECEPTOR 1; PXR1 

ACROMELALGIA, HEREDITARY 



GDB:139158 



j RESTIN; RSN 



GDB: 128054 



SARCOMA AMPLIFIED SEQUENCE; 
SAS 
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20 



Gene 


GDB Accession ID 


OMIMLink 


SCA2 ! 


GDB: 128034 


SPINOCEREBELLAR ATAXIA 2; SCA2 
ATAXIN-2; ATX2 


SCNN1A 


GDB:366596 


SODIUM CHANNEL, 
NONVOLTAGE-GATED, 1; SCNN1A 


SMAL 


GDB:9954506 


SPINAL MUSCULAR ATROPHY, 
CONGENITAL NONPROGRESSIVE, 
OF LOWER LIMBS 


SPPM 


GDB:9954502 


SCAPULOPERONEAL MYOPATHY; 
SPM 


SPSMA 


GDB:9954510 


SCAPULOPERONEAL 
AMYOTROPHY, NEUROGENIC, NEW 
ENGLAND TYPE 


TBX3 


GDB:681969 


ULNAR-MAMMARY SYNDROME; 
UMS T-BOX 3; TBX3 


TBX5 


GDB:6175917 


HOLT-ORAM SYNDROME; HOS 
T-BOX 5; TBX5 


TCF1 


GDB:125297 


TRANSCRIPTION FACTOR 1, 
HEPATIC; TCF1 MATURITY-ONSET 
DIABETES OF THE YOUNG, TYPE III; 
MODY3 


TPI1 


GDB:1 19617 


TRIOSEPHOSPHATE ISOMERASE 1; 
TPI1 


TSC3 


GDB: 127930 


SCLEROSIS-3; TSC3 


ULR 


GDB:594089 


UTERINE 


VDR 


GDB:120487 


VITAMIN D-RESISTANT RICKETS 
WITH END-ORGAN 
UNRESPONSIVENESS TO 
1 ,25-DIHYDROXYCHOLECALCIFERO 
L VITAMIN D RECEPTOR; VDR 


VWF 


GDB:1 19125 


VON WILLEBRAND DISEASE; VWD 



30 

Table 14: Genes, Locations and Genetic Disorders on Chromosome 13 
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Gene 


GDB Accession ID 


OMIM Link 


ATP7B 


GDB: 120494 


WILSON DISEASE; WND 


BRCA2 


GDB:387848 


BREAST CANCER 2, EARLY-ONSET; 
BRCA2 
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ibOSQS^PE! ,072*405 



Gene I GDB Accession ID 

BRCD1 I GDB:9954522 
CLN5 I GDB:230991 



OMIMLink 

BREAST CANCER, DUCTAL, 1; BRCD1 

CEROID-LIPOFUSCNOSIS, NEURONAL 5; 
CLN5 



CPB2 



ED2 



GDB: 1 29546 



CARBOXYPEPTIDASE B2, PLASMA; CPB2 



GDB:9834522 



ECTODERMAL DYSPLASIA, HIDROTIC; 
HED 



EDNRB 



10 



GDB: 129075 



ENDOTHELIN-B RECEPTOR; EDNRB 
HIRSCHSPRUNG DISEASE-2; HSCR2 



ENUR1 



GDB:594516 



ENURESIS, NOCTURNAL, 1 ; ENUR1 



ERCC5 



GDB: 1205 15 



EXCISION-REPAIR, COMPLEMENTING 
DEFECTIVE, IN CHINESE HAMSTER, 5; 



15 



F10 1 


(jDiJ:iiyoyu % 


X QUANTITATIVE VARIATION IN 
FACTOR X DEFICIENCY; F10 


F7 1 


GDB:liyoy/ | 


FACTOR VII DEFICIENCY 


GJB2 1 


GDB:125247 1 


GAP JUNCTION PROTEIN, BETA-2, 26 KD; 
GJB2 DEAFNESS, NEUROSENSORY, 
AUTOSOMAL RECESSIVE, 1 ; DFNB1 
DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 3; 
DFNA3 


1 GJB6 1 


GDB:9958357 | 


ECTODERMAL DYSPLASIA, HIDROTIC; 

HED DEAFNESS, AUTOSOMAL 
DOMINANT NONSYNDROMIC 
SENSORINEURAL, 3; DFNA3 \ 


1 IPF1 


GDB:448899 


1 INSULIN PROMOTER FACTOR 1 ; EPF1 j 


1 MBS1 


GDB: 128365 


MOEBIUS SYNDROME; MBS 


| MCOR 


| GDB:9954520 


1 CONGENITAL | 


1 PCCA 


| GDB: 119473 


1 GLYCINEMIA, KETOTIC, I 


1 RBI 


1 GDB: 11 8734 


BLADDER CANCER RETINOBLASTOMA; 
RBI 


| RHOK 


| GDB:371598 


RHODOPSIN KINASE; RHOK J 


| SCZD7 


| GDB:9864734 


| DISORDER-2; SCZD2 1 


1 SGCG 


1 GDB:3763329 


MUSCULAR DYSTROPHY, LIMB GIRDLE, 
1 TYPE 2C; LGMD2C | 
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Gene 


GDB Accession ID 


OMIM Link 


SLC10A2 


GDB.677534 


SOLUTE CARRIER FAMILY 10, MEMBER 
2; SLC10A2 


SLC25A15 


GDB: 120042 


HYPERORMTHINEMIA-HYPERAMMONE 
M1A-HOMOCITRULLINURIA SYNDROME 


STARP1 


GDB:635459 


STEROIDOGENIC ACUTE REGULATORY 
PROTEIN; STAR 


ZNF198 


GDB:6382650 


ZINC FINGER PROTEIN-198; ZNF198 



Table 15: Genes, Locations and Genetic Disorders on Chromosome 14 
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Gene 


GDB Accession ID 


OMIM Link 


ACHM1 


GDB:132458 


COLORBLINDNESS, TOTAL 


ARVD1 


GDB-.371339 


ARRHYTHMOGENIC RIGHT VENTRICULAR 
DYSPLASIA, FAMILIAL, 1; ARVD1 


CTAA1 


GDB:265299 


CATARACT, ANTERIOR POLAR 1; CTAA1 


DAD1 


GDB:407505 


DEFENDER AGAINST CELL DEATH; DAD1 


DFNB5 


GDB:636176 


DEAFNESS, NEUROSENSORY, 
AUTOSOMAL RECESSIVE, 5; DFNB5 


EML1 


GDB:6328385 


USHER SYNDROME, TYPE IA; USH1A 


GALC 


GDB: 119970 


KRABBE DISEASE 


GCH1 


GDB: 11 8798 


DYSTONIA, PROGRESSIVE, WITH DIURNAL 
VARIATION GTP CYCLOHYDROLASE I 
DEFICIENCY GTP CYCLOHYDROLASE I; 
GCH1 


HE1 


GDB:9957680 


MALFORMATIONS, MULTIPLE, WITH LIMB 
ABNORMALITIES AND HYPOPITUITARISM 


IBGC1 


GDB: 10450404 


CEREBRAL CALCIFICATION, 
NONARTERIOSCLEROTIC 



30 
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10 



15 



20 



25 



30 



Gene < 


3DB Accession ID | < 


3MIM Link 


IGH@ 1 < 


3DB:1 18731 |l 

] 

] 

1 


[gA CONSTANT HEAVY CHAIN 1 ; IGHA1 
[MMUNO GLOBULIN: D (DIVERSITY) 
REGION OF HEAVY CHAIN IgA CONSTANT 
HEAVY CHAIN 2; IGHA2 
IMMUNOGLOBULIN: J (JOINING) LOCI OF 
HEAVY CHAIN; IGfcU IMMUJNUuludu^. i 
HEAVY Mu CHAIN; Mul ; IGHM1 
IMMUNOGLOBULIN: VARIABLE REGION 
OF HEAVY CHAINS-Hvl ; IGHV IgG HEAVY 
CHAIN LOCUS; IGHG1 IMMUNOGLOBULIN 
Gm-2- IGHG2 IMMUNOGLOBULIN Gm-3; | 
IGHG3 IMMUNOGLOBULIN Gm-4; IGHG4 
IMMUNOGLOBULIN: HEAVY DELTA 
CHAIN; IGHD IMMUNOGLOBULIN: HEAVY 

EPSILON CHAIN; IGHE 


i IGHC group 


GDB:9992632 


IeA CONSTANT HEAV Y L-rlAIN i , ivxtla. i 
IrA CONSTANT HEAVY CHAIN 2; IGHA2 
IMMUNOGLOBULIN: HEAVY Mu CHAIN, 
Mul; IGHM1 IgG HEAVY CHAIN LOCUS; 
IGHG1 IMMUNOGLOBULIN Gm-2; IGHG2 
IMMUNOGLOBULIN Gm-3; IGHG3 
IMMUNOGLOBULIN Gm-4; IGHG4 
IMMUNOGLOBULIN: HEAVY DELTA 
rHATM' TfTHT) IMMUNOGLOBULIN: HEAVY 1 
| EPSILON CHAIN; IGHE j 


| IGHG1 


GDB:120085 


I T«fi hpavy CHAIN LOCUS; IGHG1 ! 


llGHM 


GDB: 120086 


1 TA/nwfT TMOGT OBULIN* HEAVY Mu CHAIN; 
Mul;IGHMl J 


| IGHR 


1 GDB:9954529 


| G1(A1) SYNDROME | 


1 W 


| GDB: 139274 


1 INVERSUS VISCERUM 


1 LTBP2 


1 GDB:453890 


LATENT TRANSFORMING GROWTH 
1 FACTOR-BETA BlINL/lJN^jr j'KXJiJiiiN i^i-D-r^ 


1 MCOP 


1 GDB:9954527 


1 MICROPHTHALMOS 


1 MJD 


| GDB:1 18840 


1 MACHADO-JOSEPH DISEASE; MJD 


| MNG1 


| GDB.6540062 


1 GOITER, MULTINODULAR 1; MNG1 


| MPDl 


GDB:230271 


1 MYOPATHY, LATE DISTAL HEREDITARY 


|MPS3C 


GDB:9954532 


MUCOPOLYSACCHARIDOSIS TYPE IUC 


1 MYH6 


GDB:120214 


1 MYOSIN, HEAVY POLYPEPTIDE 6; MYH6 



35 
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j Gene 


GDB Accession ID 


OMIM Link j 


MYH7 


GDB- 12091 ^ 


MYOMN, CARDIAC, HEAVY CHAIN, BETA; 
MYH7 


I NP 


GDR-1909TQ 


NUCLEOSIDE PHOSPHOR YLASE; NP 


PABPN1 


VJJL/JJ.^/V/ f U J 


UCULOPHARYNGEAL MUSCULAR 
DYSTROPHY; OPMD OCULOPHARYNGEAL 
MUSCULAR DYSTROPHY, AUTOSOMAL 
RECESSIVE POLY ADEN YLATE-BINDING 
PROTEIN-2; PABP2 


PSEN1 


GDB: 135682 


ALZHEIMER DISEASE, FAMILIAL, TYPE 3- 
AD3 


J PYGL 


GDB: 120328 


GLYCOGEN STORAGE DISEASE VI 


I RPGRIP1 


GDB: 11498766 


AMAUROSIS CONGENITA OF LEBER I ! 


| SERPINAl 


GDB: 120289 


PROTEASE INHIBITOR 1 ; PI 


| SERPINA3 


GDB: 118955 


ALPHA- 1 -ANTICHYMOTRYPSIN; AACT 


1 SERPEN A6 


GDB: 127865 


CORTICOSTEROID-BINDING GLOBULIN; 
CBG 


1 9r C7A7 


GUJt»:yo63033 


DIBASICAMINO ACIDURIA II 


1 SPG3A 


GDB:230126 


SPASTIC PARAPLEGIA-3, AUTOSOMAL 
DOMINANT; SPG3A 


SPTB 


GDB: 119602 


ELLIPTOCYTOSIS, RHESUS-UNLINKED 
TYPE HEREDITARY HEMOLYTIC 
SPECTRIN, BETA, ERYTHROCYTIC; SPTB 




GDB:250785 


T-CELL LYMPHOMA OR LEUKEMIA 


TCRAV17S1 


GDB:642130 


T-CELL ANTIGEN RECEPTOR, ALPHA 
SUBUNIT; TCRA 


TCRAV5S1 


GDB:451966 


T-CELL ANTIGEN RECEPTOR, ALPHA 
SUBUNIT; TCRA 


TGM1 


GDB.125299 


TRANSGLUTAMINASE 1; TGM1 


TITF1 


GDB: 132588 


THYROID TRANSCRIPTION FACTOR 1; 
TITF1 


TMIP 


GDB.-9954523 


AND ULNA, DUPLICATION OF, WITH 
ABSENCE OF TIBIA AND RADIUS 


TRA@ 


GDB.120404 


T-CELL ANTIGEN RECEPTOR, ALPHA 
SUBUNIT; TCRA 
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Gene 


GDB Accession ID 


OMIM Link 


TSHR 


GDB:125313 


THYROTROPIN, UNRESPONSIVENESS TO 


USH1A 


GDB:118885 


USHER SYNDROME, TYPE IA; USH1A 


VP 


GDB: 120492 


PORPHYRIA V ARIEGATA 



Table 16: Genes, Locations and Genetic Disorders on Chromosome 15 
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Gene 


GDB Accession ID 


OMIM Link 


ACCPN 


GDB:5457725 


CORPUS CALLOSUM, AGENESIS OF, 
WITH NEURONOPATHY 


AH02 


GDB:9954535 


HEREDITARY OSTEODYSTROPHY-2; 
AH02 


ANCR 


GDB: 119678 


ANGELMAN SYNDROME 


B2M 


GDB: 119028 


BETA-2-MICROGLOBULIN; B2M 


BBS4 


GDB:511199 


BARDET-BIEDL SYNDROME, TYPE 4; 
BBS4 


BLM 


GDB: 135698 


BLOOM SYNDROME; BLM 


CAPN3 


GDB:1 19751 


CALPAIN, LARGE POLYPEPTIDE L3; 
CAPN3 MUSCULAR DYSTROPHY, 
LIMB-GIRDLE, TYPE 2; LGMD2 


CDAN1 


GDB-.9823267 


DYSERYTHROPOIETIC ANEMIA, 
CONGENITAL, TYPE I 


CDAN3 


GDB:386192 


DYSERYTHROPOIETIC ANEMIA, 
CONGENITAL, TYPE III; CDAN3 


CLN6 


GDB:4073043 


CERODD-LIPOFUSCINOSIS, NEURONAL 6, 
LATE INFANTILE, VARIANT; CLN6 


CMH3 


GDB: 138299 


CARDIOMYOPATHY, FAMILIAL 
HYPERTROPHIC, 3; CMH3 


CYP19 


GDB: 119830 


CYTOCHROME P450, SUBFAMILY XIX; 
CYP19 


CYP1A1 


GDB: 120604 


CYTOCHROME P450, SUBFAMILY I, 
POLYPEPTIDE 1; CYP1A1 


CYP1A2 


GDB: 118780 ] 


CYTOCHROME P450, SUBFAMILY I, 
POLYPEPTIDE 2; CYP1 A2 


DYX1 


GDB: 1391 796 


DYSLEXIA, SPECIFIC, 1; DYX1 



- 104- 
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Gene 


GDB Accession ID 


OMIM Link 




EPB42 


GDB- 127385 


RFR FDTT A R Y HFMOT .YTTC PROTFTNJ 4 7 
ERYTHROCYTIC; EPB42 


5 


ETFA 


GDB:119121 


GLUTARICACIDURIA IIA; GA IIA 




EYCL3 


GDB:4590306 


EYE COLOR-3; EYCL3 




FAH 


GDB: 11 9901 


TYROSINEMIA, TYPE I 




FBN1 


GDB: 1271 15 


FIBRILLIN-!; FBN1 MARFAN 
SYNDROME; MFS 


FES 


GDB: 11 9906 


V-FES FELINE SARCOMA VTRAL/V-FPS 
FUJTNAMI AVIAN SARCOMA VIRAL 

V^* 1 > V — \J V_J llj 1\JLV 




HCVS 


GDR-1 19^06 


POP CVM A VTRT 990P T^PFPTTRTT TTV- 

CVS 


15 


HEXA 


GDB: 120040 


TAY-SACHS DISEASE; TSD 




IVD 


GDB: 119354 


ISOVALERICACIDEMIA; IVA 




LCS1 


GDB: 11500552 


CHOLESTASIS-LYMPHEDEMA 
bYNDKOMb 




LIPC 


GDB: 119366 


LIPASE, HEPATIC; LIPC 


20 


MY05A 


GDB:218824 


MYOSIN VA; MY05A 




OCA2 


GDR- 136870 


AT RTMTSM TT 




OTSC1 


GDB:9860473 


OTOSCLEROSIS 




PWCR 


GDB: 120325 


PRADER-WILLI SYNDROME 


25 


RLBP1 


GDB: 127341 


RETIN ALDEHYDE-BINDING PROTEIN 1,; 
RLBP1 




SLC12A1 


GDB.-386121 


SOLUTE CARRIER FAMILY 12. MEMBER 
1; SLC12A1 


30 


SPG6 


GDB:511201 


SPASTIC PARAPLEGIA 6, AUTOSOMAL 
DOMINANT; SPG6 




TPM1 


GDB: 127875 


TROPOMYOSIN 1; TPM1 




UBE3A 


GDB-.228487 


ANGELMAN SYNDROME 
UBIQUITIN-PROTEIN LIGASE E3 A; 
UBE3A 


35 


WMS 


GDB:5583902 


WEILL-MARCHESANI SYNDROME 
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Table 17: 



Genes, Locations and Genetic Disorders on Chromosome 16 



Gene 


GDB Accession ID 


OMIM Link 


ABCC6 


GDB:9315106 


PSEUDOXANTHOMA ELASTICUM, 
AUTOSOMAL DOMINANT; PXE 
PSEUDOXANTHOMA ELASTICUM, 
AUTOSOMAL RECESSIVE- PXE 


ALDOA 


GDB: 118993 


ALDOLASE A, FRUCTOSE-BISPHOSPHATE; 
ALDOA 


APRT 


GDB: 11 9003 


ADENINE 

PHOSPHORIBOSYLTRANSFERASE- APRT 


ATP2A1 


GDB:1 19716 


ATPase Caf2+VTRANSPORTING 
FAST-TWITCH 1; ATP2A1 BRODY 
MYOPATHY 


BBS2 


GDB:229992 


BARDET-BIEDL SYNDROME, TYPE 2; BBS2 


CARD15 


GDB: 11 026232 


SYNOVITIS, GRANULOMATOUS, WITH 
UVEITIS AND CRANIAL NEUROPATHIES 


CATM 


GDB:701219 


MICROPHTHALMIA-CATARACT 


CDH1 


GDB.120484 


CADHERIN 1; CDH1 


CETP 


GDB:1 19773 


CHOLESTERYL ESTER TRANSFER 

PP OTP TNT PT AQISVTA* r*T?TT> 
rtsxj i dun, JrLf/\olVl/\, v^xil r 


CHST6 


GDB:131407 


CORNEAL DYSTROPHY, MACULAR TYPE 


CLN3 


GDB: 120593 


CEROID-LIPOFUSCINOSIS NEURONAL 3 
JUVENILE; CLN3 


CREBBP 


GDB :437159 


RUBINSTEIN SYNDROME CREB-BINDING 
PROTEIN; CREBBP 


CTH 


GDB.1 19086 


CYSTATHIONINURIA 


CTM 


GDB:1 19819 


CATARACT, ZONULAR 


CYBA 


GDB:125238 


GRANULOMATOUS DISEASE, CHRONIC, 
AUTOSOMAL CYTOCHROME-b-NEGATTVE 
FORM 


CYLD 


GDB:701216 


EPITHELIOMA, HEREDITARY MULTIPLE 
BENIGN CYSTIC 


DHS 


GDB:9958268 


XEROCYTOSIS, HEREDITARY 


DNASE1 


GDB: 132846 


DEOXYRIBONUCLEASE I; DNASE1 
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Gene 



DPEP1 
ERCC4 



I FANCA 



10 



GALNS 
GAN 
HAGH 

Ihbai 



15 I HBA2 



20 



HBHR 

HBQ1 
HBZ 
| HBZP 
HP 



i GDB Accession ID OMIM Link 



25 



HSD11B2 
IL4R 



LIPB 



i MC1R 



MEFV 



GDB: 128059 
GDB:1 19113 

GDB:701221 



30 MHC2TA 



35 



MLYCD 
PHKB 



| GDB-.129085 
GDB:9864885 
GDB: 11 9292 

GDB: 11 9293 



GDB:9954541 

GDB:120036 
GDB: 119302 
GDB: 120037 



GDB.409951 
GDB: 118823 



GDB: 119365 



GDB: 135 162 



GDB:125263 



GDB:6268475 



, RENAL DIPEPTIDASE 

EXCISION-REPAIR, COMPLEMENTING 
DEFECTIVE, IN CHINESE HAMSTER, 4; 
ERCC4 XERODERMA PIGMENTOSUM, 
COMPLEMENTATION GROUP F; XPF 

FANCONI ANEMIA, COMPLEMENTATION 

GROUP A; FACA 

M UCOPOLYSACCHARIDOSIS TYPE IVA 

i NEUROPATHY, GIANT AXONAL; GAN 



HYDROXY ACYL GLUTATHIONE 
I HY DROLASE; HAGH 

HEMOGLOBIN-ALPHA LOCUS-1; HBA1 



GDB:1 19294 HEMOGLOBIN-ALPHA LOCUS-2; HBA2 



I HEMOGLOBIN H-RELATED MENTAL 
| RETARDATION 

H EMOGLOBIN— THET A- 1 LOCUS; HBQ1 
H EMOGLOBIN— ZETA LOCUS; HBZ 
HF.MOGLOBIN-ZETA LOCUS; HBZ 



GDB: 1 193 14 HAPTOGLOBIN; HP 



CORTISOL 1 1-BETA-KETOREDUCTASE 
DEFICIENCY 

INTERLEUKIN-4 RECEPTOR; IL4R 



GDB:1 1500940 
GDB: 120286 



LIPASE B, LYSOSOMAL ACID; LIPB 



MELANOCORTIN-1 RECEPTOR; MC1R 



MEDITERRANEAN FEVER, FAMILIAL; 
MEFV 



MHC CLASS n TRANSACTIVATOR; 
MHC2TA 



MALONYL CoA DECARBOXYLASE 
DEFICIENCY 

PHOSPHORYLASE KINASE, BETA 
SUBUNIT; PHKB 
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25 



Gene 


GDB Accession ID 


OMIM Link 


PHKG2 


GDB:140316 


PHOSPHORYLASE KINASE, TESTIS/LIVER, 
GAMMA 2; PHKG2 


PKD1 


GDB: 120293 


POLYCYSTIC KIDNEYS POLYCYSTIC 
KIDNEY DISEASE 1; PKD1 


PKDTS 


GDB:9954545 


POLYCYSTIC KIDNEY DISEASE, INFANTILE 
WVTPPP WITH TTTP.FROTTS SCT FROSTS* 

O DV "• **■ ~-j Willi lUDl'IvvUJ O^L/ijiVVyuiu, 


PMM2 


GDB:438697 


CARBOHYDRATE-DEFICIENT 

fiT VfTVPR OTFTNT SYNDROMR TYPE I' CDGl 

PHOSPHOMANNOMUTASE 2; PMM2 


PXE 


GDB:6053895 


PSEUDOXANTHOMA ELASTICUM, 
AUTOSOMAL DOMINANT; PXE 

T*OT7T tt"\/"W A \TTT T/"\X X A T7T A CTTPT TK A 

AUTOSOMAL RECESSIVE; PXE 


SALL1 


GDB:4216161 


TOWNcb-BROLNS oYJNUKUMli; Itia 
SAL-LIKE 1; SALL1 


SCA4 


GDB:250364 


SPINOCEREBELLAR ATAXIA 4; SCA4 


SCNN1B 


GDB:434471 


SODIUM CHANNEL, NONVOLTAGE-GATED 
1 BETA; SCNN1B 


SCNN1G 


GDB:568759 


SODIUM CHANNEL, NONVOLTAGE-GATED 
1 GAMMA; SCNN1G 


TAT 


GDB: 120398 


TYROSINE TRANSAMINASE DEFICIENCY 


TSC2 


GDB: 120466 


TUBEROUS SCLEROSIS-2; TSC2 


VDI 


GDB: 119629 


DEFECTIVE INTERFERING PARTICLE 
INDUCTION, CONTROL OF 


WT3 


GDB:9958957 


WILMS TUMOR, TYPE III; WT3 



Table 18: Genes, Locations and Genetic Disorders on Chromosome 17 



Gene 


GDB Accession ID 


OMIM Link 


ABR 


GDB:1 19642 


ACTIVE BCR-RELATED GENE; ABR 


ACACA 


GDB: 120534 


ACETYL-CoA CARBOXYLASE 
DEFICIENCY 


ACADVL 


GDB: 1248 185 


ACYL-CoA DEHYDROGENASE, 
VERY-LONG-CHAIN, DEFICIENCY OF 


ACE 


GDB:1 19840 


DIPEPTIDYL CARBOXYPEPTIDASE-1; 
DCP1 
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83 cms 



B2ES .072H0S 





Gene 


GDB Accession ID i 


OMEVI Link 




ALDH3A2 


GDB.13 16855 


SJOGREN-LARSSON SYNDROME; SLS 1 




APOH 


GDB:1 18887 


APOLIPOPROTEIN H; APOH I 


5 " 


ASPA 


GDB:231014 


SPONGY Dh.OJiJNJb.KAl JVJJN \Jr i^hin 1 i 
NERVOUS SYSTEM | 




AXIN2 


GDB:9864782 


CANCER OF COLON I 




BCL5 


GDB:125178 


LEUKEMIA/LYMPHOMA, CHRONIC | 
B-CELL, 5; BCL5 


10 


BHD 


GDB: 11498904 


WITH TRICHODISCOMAS AND 
ACROCHORDONS 




BLMH 


GDB:3801467 


BLEOMYCIN HYDROLASE j 




BRCA1 


GDB:126611 


BREAS1 CAlNCiiK, 1 YJrJti l, dklai j 


15 


CACD 


GDB-.5885801 


CHOROIDAL DYSTROPHY, CENTRAL 

A T» r7/~\T AD. A /^T\ 1 

AREOLAR; CACJJ | 




CCA1 


GDB: 118763 


CATARACT, CONGENITAL, CERULEAN 1 
TYPE 1; CCA1 ] 




CCZS 


GDB:681973 


CATARACT, CONGENITAL ZONULAR, 1 
WITH SUTURAL OPACITIES; CCZS 


20 


CHRNB1 




PHOT TNTFRfrTC RFCEPTOR NICOTINIC, 
BETA POLYPEPTIDE 1 ; CHRNB1 | 




CHRNE 


GDB: 132246 


CHOLINERGIC RECEPTOR, NICOTINIC, I 
EPSILON POLYPEPTIDE; CHRNE 


25 


CMT1A 


GDB: 119785 


CHARCOT-MARIE-TOOTH DISEASE, 1 
TVPF 1 A • PMT1 A NEUROPATHY. 1 
HEREDITARY, WITH LIABILITY TO 
PRESSURE PALSIES; HNPP 




COL1A1 


GDB-.119061 


COLLAGEN, TYPE I, ALPHA- 1 CHAIN; 1 
POT 1A1 OSTEOGENESIS IMPERFECTA 
TYPE I OSTEOGENESIS IMPERFECTA 
TYPE IV; OI4 


30 


CORD5 


GDB:568473 


CONE-ROD DYSTROPHY-5; CORD5 




CTNS 


GDB:700761 


CYSTINOSIS, EARLY-ONSET OR 1 
INFANTILE NEPHROPATHIC TYPE ( 




EPX 


GDB.-377700 


EOSINOPHIL PEROXIDASE; EPX J 



35 
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o 72 J wa 



GDB Accession ID I OMEM Link 



ERBB2 1 


GDB:120613 


V-ERB-B2 AVIAN ERYTHROBLASTIC | 
t PT TK^MT A VIRAL ONCOGENE ! 
HOMOLOG 2; ERBB2 


G6PC 1 


GPB:231927 1 


GLYCOGEN STORAGE DISEASE I; GSD-I 


GAA 1 


GDB:1 19965 1 


GLYCOGEN STORAGE DISEASE II 


GALK1 


GDB: 119246 


GALACTOKIN ASE DEFICIENCY 


GCGR ! 


1 GDB:304516 


GLUCAGON RECEPTOR; GCGR 


GFAP 


1 GDB: 11 8799 1 


GLIAL FIBRILLARY ACIDIC rKU i win, 
GFAP ALEXANDER DISEASE 


1 GH1 


| GDB: 11 9982 J 


| GROWTH HORMONE 1; GH1 


| GH2 


1 GDB: 119983 


1 GROWTH HORMONE 2; GH2 


1 GP1BA 


1 GDB:1 18806 


GIANT PLATELET SYNDROME 


| GPSC 


1 GDB:9954564 


FAMILIAL PROGRESSIVE SUBCORTICAL 


1 GUCY2D 


I GDB-.136012 


AMAUROSIS CONGENITA OF LEBER I 
GUANYLATE CYCLASE 2D, MEMBRANE; 



10 



15 



20 



ITGA2B 
ITGB3 



GDB:120012 
I GDB:120013 



CORD6 

| THROMBASTHENIA OF GLANZMANN 
| AND NAEGELI 

INTEGRIN, BETA-3;ITGB3 



ITGB4 



25 



KRT10 
KRT12 



KRT13 



30 



KRT14 



KRT14L1 



KRT14L2 



GDB:128028 



I INTEGRIN, BETA-4; ITGB4 



GDB: 118828 
GDB-.5583953 



KERATIN 10; KRT10 

CORNEAL DYSTROPHY, JUVENILE 
EPITHELIAL, OF MEESMANN KERATIN 
12; KRT12 



GDB: 120740 



KERATIN 13;KRT13 



GDB:132145 



| KERATIN 14; KRT14 GLUTATHIONE 
SYNTHETASE; GSS 



GDB: 120121 



KERATIN 14; KRT14 



GDB: 120122 



KERATIN 14; KRT14 



KRT14L3 



GDB: 120123 



KERATIN 14; KRT14 



35 KRT16 



GDB: 136207 



KERATIN 16; KRT16 
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Gene | 


GDB Accession ID 


OMIM Link 




KRT16L1 


GDB: 120125 


KERATIN 16; KRT16 1 




KRT16L2 i 


GDB: 120126 


KERATIN 16; KRT16 1 


5 r 


KRT17 1 


GDB:136211 


KERATIN 17; KRT17 PACHYONYCHIA 1 
rnMOFNTTA JACKSON-LA WLER TYPE 




KRT9 


GDB:303y /U 


wvPFttKFRATOSIS LOCALIZED 1 
EPIDERMOLYTIC _| 


10 


MAPT 1 


GDB.l 19434 


MICROTUBULE-ASSOCIATED PROTEIN 1 
TAU; MAPT PALLIDOPONTONIGRAL 

TTBflTTWTJ ft A TION • PPND ! 

DISINMBITION-DEMENTIA-PARKINSONI 
SM-AMYOTROPHY COMPLEX; DDPAC 




MDB J 


GDB:9958959 


MEDULLOBLASTOMA; MDB 1 


15 


MDCR | 


GDB:12052D 


Turn t FR.nTFKER LISSENCEPHALY 1 
SYNDROME; MDLS 

PL ATELET- AU llVAlUNLrr Av iuk. I 
ACETYLHYDROLASE, GAMMA SUBUNIT 




MGI 


GDB:9954550 


MYASTHENIA GRAVIS, FAMILIAL 1 
INFANTILE; FIMG 


20 


MHS2 


rGDB:132580 


MALIGNANT HYPERTHERMIA 1 
SUSCEPTIBILITY-2; MHS2 J 




1 MKS1 


1 GDB:681967 


MECKEL SYNDROME; MKS J 




| MPO 


| GDB:120192 


MYELOPEROXIDASE DEFICIENCY j 




1 MUL 


| GDB:636050 


MULIBREY NANISM; MUL _| 


25 


1 MY015A 


1 GDB:9838006 


DEAFNESS, NEUROSENSORY, 
AUTOSOMAL RECESSIVE, 3; DFNB3 




| NAGLU 


1 GDB:636533 


MUCOPOLYSACCHARIDOSIS TYPE UIB I 




1 XT A TiT> 

1 NAPB 


l KjDD.yyo^o / 


NEURITIS WITH BRACHIAL i 
PREDILECTION; NAPB J 


30 


Infi 


| GDB: 120231 


NEUROFIBROMATOSIS, TYPE I; NF1 j 




Inmei 


1 GDB: 127965 


NON-METASTATIC CELLS 1 , PROTEIN 1 
EXPRESSED IN; NME1 




1 P4HB 


1 GDB: 120708 


PROLYL-4-HYDROXYLASE, BETA 1 
POLYPEPTIDE; PHDB; PROHB J 



35 



-111- 
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Gene 


GDB Accession ID 


OMIM Link 


5 


PAFAH1B1 


GDB.677430 


MILLER-DIEKER LISSENCEPHALY ! 
SYNDROME; MDLS 
PLATELET-ACTIVATING FACTOR 
ACETYLHYDROLASE, GAMMA SUBUNIT 




PECAM1 


GDB:696372 


PLATELET-ENDOTHELIAL CELL 
ADHESION MOLECULE; PECAM1 




PEX12 


GDB:6155804 


ZELLWEGER SYNDROME; ZS 
PEROXIN-12; PEX12 


10 


PHB 


GDB: 126600 


PROHIBITIN; PHB 




PMP22 


GDB:134190 


CHARCOT-MAREE-TOOTH DISEASE, 
1 Yrrs 1A; LM1 1 A ri Y r tvtv 1 isAJr rllA^ 
NEUROPATHY OF DEJERINE-SOTTAS 
PERIPHERAL MYELIN PROTEIN 22; 
PMP22 


15 


PRKAR1A 


GDB: 1203 13 


MYXOMA, SPOTTY PIGMENTATION, 
AND ENDOCRINE OVERACTIVITY 
PROTEIN KIN ASE, cAMP-DEPENDENT, 
REGULATORY, TYPE I, ALPHA; 
PRKAR1A 




PRKCA 


GDB:128015 


PROTEIN KINASE C, ALPHA; PRKCA 


20 


PRKWNK4 


GDB:9954566 


PSEUDOHYPOALDOSTERONISM TYPE II, 
LOCUS B; PHA2B 




PRP8 


GDB:9957697 


RETINITIS PIGMENTOSA-13; RP13 




PRPF8 


GDB:392647 


RETINITIS PIGMENTOSA-13; RP13 




PTLAH 


GDB:9957342 


APLASIA OR HYPOPLASIA 


25 


RARA 


GDB: 120337 


RETINOIC ACID RECEPTOR, ALPHA; 
RARA 




RCV1 


GDB: 135477 


RECOVERIN; RCV1 


30 


RMSA1 


GDB:304519 


REGULATOR OF MITOTIC SPINDLE 
ASSEMBLY 1; RMSA1 


RP17 


GDB.683199 


RETINITIS PIGMENTOSA-17; RP17 




RSS 


GDB-.439249 


RUSSELL-SILVER SYNDROME; RSS 




SCN4A 


GDB:125181 


PERIODIC PARALYSIS U 


35 


SERPINF2 


GDB: 120301 


PLASMIN INHIBITOR DEFICIENCY 


SGCA 


GDB.384077 


ADHALIN; ADL 
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Gene 



SGSH 



GDB Accession ID 



GDB:1319101 



OMIM Link 



MUCOPOLYSACCHARIDOSIS TYPE IIIA 



SHBG 



SLC2A4 



SLC4A1 



GDB: 125280 



SEX HORMONE BINDING GLOBULIN; 
SHBG 



GDB.l 19997 



SOLUTE CARRIER FAMILY 2, MEMBER 4; 
SLC2A4 ______ 



GDB.l 19874 



SOLUTE CARRIER FAMILY 4, ANION 
EXCHANGER, MEMBER 1; SLC4A1 
BLOOD GROUP-DIEGO SYSTEM; DI 
BLOOD GROUP-WRIGHT ANTIGEN; Wr 
ELLIPTOCYTOSIS, RHESUS-UNLINKED 
TYPE HEREDITARY HEMOLYTIC 



SLC6A4 



SMCR 



SOST 



S0X9 



SSTR2 
SYM1 



SYNS1 



TCF2 



GDB:134713 



SOLUTE CARRIER FAMILY 6, MEMBER 4; 
SLC6A4 



GDB:120379 



SMITH-MAGENIS SYNDROME; SMS 



GDB: 10450629 



SCLEROSTEOSIS 



GDB: 134730 



DYSPLASIA 



GDB:134186 



SOMATOSTATIN RECEPTOR-2; SSTR2 



GDB:5 12174 



SYMPHALANGISM, PROXIMAL; SYM1 



GDB:9862343 



SYNOSTOSES, MULTIPLE, WITH 
BRACHYDACTYLY 



GDB:125298 



| TRANSCRIPTION FACTOR-2, HEPATIC; 
TCF2 



THRA 



GDB: 120730 



THYROID HORMONE RECEPTOR, ALPHA 
1; THRA 



TIMP2 



TOC 



GDB: 132612 



TISSUE INHIBITOR OF 
METALLOPROTEINASE-2; TIMP2 



GDB:451978 



TYLOSIS WITH ESOPHAGEAL CANCER; 
TOC 



TOP2A 



GDB: 11 8884 



TOPOISOMERASE (DNA) II, ALPHA; 
TOP2A 



TP53 



VBCH 



GDB: 120445 



CANCER, HEPATOCELLULAR 
LI-FRAUMENI SYNDROME; LFS TUMOR 
I PROTEIN p53; TP53 CARCINOMA 



GDB:9954554 



HYPEROSTOSIS CORTICAL1S 
GENERALISATA 
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Table 19: 



Genes, Locations and Genetic Disorders on Chromosome 18 



Gene 


GDB Accession ID 


OMIM Link 


ATP8B1 


GDB:453352 


CHOLESTASIS, PROGRESSIVE FAMILIAL 
INTRAHEPATIC 1; PFIC1 INTRAHEPATIC 
CHOLESTASIS FAMILIAL INTRAHEPATIC 
CHOLESTASIS- 1; FIC1 


BCL2 


GDB:1 19031 


B-CELL CLL/LYMPHOMA 2; BCL2 


CNSN 


GDB.9954580 


CARNOSINEMIA 


CORD1 


GDB: 118773 


CONE-ROD DYSTROPHY- 1 ; CORD1 


CYB5 


GDB.T25236 


METHEMOGLOBINEMIA DUE TO 
DEFICIENCY OF CYTOCHROME b5 


DCC 


GDB.T 19838 


DELETED IN COLORECTAL CARCINOMA; 
DCC 


F5F8D 


GDB:6919858 


FACTOR V AND FACTOR VIII, COMBINED 
DEFICIENCY OF; F5F8D 


FBCH 


GDB: 127282 


PROTOPORPHYRIA, ERYTHROPOIETIC 


FEO 


GDB:4378120 


POLYOSTOTIC OSTEOLYTIC DYSPLASIA, 
HEREDITARY EXPANSILE; HEPOD 


LAMA3 


GDB:251818 


LAMININ, ALPHA 3; LAMA3 


LCFS2 


GDB:9954578 


CANCER 


MADH4 


GDB:4642788 


POLYPOSIS, JUVENILE INTESTINAL 
MOTHERS AGAINST DECAPENTAPLEGIC, 
DROSOPHILA, HOMOLOG OF, 4; MADH4 


MAFD1 


GDB: 1201 63 


MANIC-DEPRESSrVE PSYCHOSIS, 
AUTOSOMAL 


MC2R 


GDB:135163 


ADRENAL UNRESPONSIVENESS TO ACTH 


MCL 


GDB:9954574 


LEIOMYOMATA, HEREDITARY MULTIPLE, 
OF SKIN 


MYP2 


GDB:9862232 


MYOPIA 


NPC1 


GDB:138178 


NIEMANN-PICK DISEASE, TYPE CI ; NPC1 


SPPK 


GDB .606444 


PALMOPLANTARIS STRIATA 


TGFBRE 


GDB:250852 


TRANSFORMING GROWTH FACTOR, BETA 
1 RESPONSE ELEMENT 


TGIF 


GDB:9787150 


HOLOPROSENCEPHALY, TYPE 4; HPE4 


TTR 


GDB: 119471 


TRANSTHYRETIN; TTR 



10 
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35 



-114 



NY2- 1321355.1 



Table 20: Genes, Locations and Genetic Disorders on Chromosome 1 9 



Gene JJ 


3DB Accession ID | < 


OMIM Link _l 


AD2 1 < 


3DB.118748 Jj 


ALZHEIMER DISEASE-2;AD2 


AMH 1 < 


GDB: 118996 


PERSISTENT MULLERIAN DUCT 
SYNDROME, TYPES I AMD 11, rMUa j 
ANTI-MULLERIAN HORMONE; AMH 1 


APOC2 1 


GDB: 119689 | 


APOLIPOPROTEIN C-II DEFICIENCY, 
TYPE I HYPERLIPOPROTEINEMIA DUE 
TO J 


APOE 1 


GDB: 119691 [ 


APOLIPOPROTEIN E; APOE f 


ATHS j 


GDB: 128803 1 


LIPOPROTEIN PHENOTYPE; ALP 


BAX 1 


GDB:228082 


BCL2-ASSOCIATED X PROTEIN; BAX\ 1 


BCKDHA J 


GDB: 119723 1 


MAPLE SYRUP URINE DISEASE 1 


BCL3 j 


GDB:120561 1 


B-CELL LEUKEMIA/LYMPHOMA-3; 
BCL3 _j 


BFIC | 




RFNIGN FAMILIAL INFANTILE 
CONVULSIONS 


C3 


GDB: 119044 


COMPLEMENT COMPONENT-3; C3 _j 


CACNA1A 


GDB: 126432 


ATAXIA, PERIODIC 

VESTIBULOCEREBELLAR HEMIPLEGIC 

MIGRAINE, FAMILIAL; MHP 
SPINOCEREBELLAR ATAXIA 6; SCA6 

1 OAT /^TT TTV/f r^Vf A "NTWPT 1 

VOLTAGE-DEPENDENT, P/Q TYPE, 
ALPHA 1A SUBUNIT; CACNA1 A 


CCO 


| GDB: 119755 


j CENTRAL CORE DISEASE OF MUSCLE 1 


CEACAM5 


| GDB: 119054 


1 CARCINOEMBRYONIC ANTIGEN; CEA 1 


POMP 

) 


GDB:344263 


EPIPHYSEAL DYSPLASIA, MULTIPLE; 1 
MED PSEUDOACHONDROPLASTIC 
DYSPLASIA CARTILAGE OLIGOMERIC 
1 MATRIX PROTEIN; COMP _J 


CRX 


1 GDB.333932 


CONE-ROD DYSTROPHY-2; CORD2 
AMAUROSIS CONGENITA OF LEBER I 
CONE-ROD HOMEO BOX-CONTAINING 
GENE 


5 DBA 


1 GDB.9600353 


ANEMIA, CONGENITAL HYPOPLASTIC, 1 
1 OF BLACKFAN AND DIAMOND j 



10 



15 



20 



25 
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| Gene M 


GDB Accession ID | < 


DMJM Link 


1 DDU I < 


GDB: 10796026 1 1 


URTICARIA; DDU 


1 DFNA4 1 


GDB:606540 J 


DEAFNESS, AUTOSOMAL DOMINANT j 
NONSYNDROM1U bllfSovJlUiNiiuxs^vL', -t, i 

DFNA4 S 


1 DLL3 1 


GDB:9959026 


VERTEBRAL ANOMALIES j 


| DMPK j 


GDB: 119097 1 


DYSTROPHIA MYUl ujnu^a., l/lvl j 


1 DMWD 1 


GDB:7178354 1 


DYSTROPHIA MYUl UN li^A, l»ivi j 


1 DPD1 1 


GDB: 107961 70 [ 


ENGELMANN DISEASE 


Ieiis I 


GDB: 119101 1 


ECHO 1 1 SENSITIVITY; El 1 S 


] ELA2 | 


GDB: 11 8792 


ELASTASE-2; ELA2 NEUTROPENIA, 
CYCLIC 


1 EPOR _| 


GDB: 125242 1 


ERYTHROPOIETIN RECEPTOR; EPOR 


I ERCC2 1 

j 


GDB:119112 1 


EXCISION-REPAIR COMPLEMENTING | 
nPFFPTTVF TN CHINESE HAMSTER, 2; 1 
ERCC2 XERODERMA PIGMENTOSUM 
1 IV; XP4 j 


ETFB 


IgDB:1 19887 


1 v~»t ¥-^TT»rYkvT TD ATvTCT7ThT? T7T AVOPR OTFJN 1 
ELECTRON 1 KAN o r liK r u/x v urivu 1 oiin , i 

j BETA POLYPEPTIDE; ETFB 


) EXT3 


| GDB:383780 


j EXOSTOSES, MULTIPLE, TYPE III; EXT3 1 


1 EYCL1 


|GDB:1 19269 


| EYE COLOR-1 ; EYCL1 | 


FTL 


|GDB:1 19234 


1 r>r»nnTTTKT T T^TJT' i^U ATM- T7TT I 

1 FERRITIN LlOri 1 CrlAUN, r 1 L, J 


; iFUTl 


j GDB:120618 


J FUCOS YLT KAN br fcKAoii- i,ruii j 


1 FUT2 


| GDB:120619 


FUCOSYL 1 KANorl^KAoii-z, r u i z 


1 FUT6 


| GDB:135180 


1 FUCOSYLTRANSFERASE-6; FUT6 1 


1 GAMT 


I GDB: 13 13736 


GUANIDINOACETATE I 
METHYLTRANSFbRAbiJ, liAM l j 


J | GCDH 


I LnJ£>. 1 JOUUH 


| GLUTARICACIDEMIA I ] 


1 GPI 


| GDB:120015 


GLUCOSEPHOSPHATE ISOMERASE; GPI j 


1 GUSM 


1 GDB:1 19291 


GLUCURONIDASE, MOUSE, MODIFIER 1 
OF; GUSM 1 


1 HB1 


| GDB:9954586 


| BUNDLE BRANCH BLOCK 


5 

| HCL1 


| GDB:1 19304 


1 HAIR COLOR-1; HCL1 | 
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Gene 



GDB Accession ID OMIM Link 



HHC2 



GDB:249836 



HYPOCALCIURIC HYPERCALCEMIA, 
FAMILIAL, TYPE II; HHC2 



HHC3 



GDB:9955121 



HYPOCALCIURIC HYPERCALCEMIA, 
FAMILIAL, TYPE IE; HHC3 ' 



ICAM3 



GDB: 136236 



INTERCELLULAR ADHESION 
MOLECULE-3; ICAM3 



INSR 



GDB:1 19352 



INSULIN RECEPTOR; INSR 



10 I JAK3 



GDB.376460 



JANUS KINASE 3 JAK3 



KLK3 



GDB: 119695 



ANTIGEN, PROSTATE-SPECIFIC; APS 



15 



LDLR 



LHB 



GDB: 1 19362 



HYPERCHOLESTEROLEMIA, FAMILIAL; 
FHC 



GDB:1 19364 



j LUTEINIZING HORMONE, BETA 
POLYPEPTIDE; LHB 



LIG1 



LOH19CR1 



GDB:127274 



LIGASE I, DNA, ATP-DEPENDENT; LIG1 



GDB:9837482 



ANEMIA, CONGENITAL HYPOPLASTIC, 
OF BLACKFAN AND DIAMOND 



LYL1 



GDB:120158 



LEUKEMIA, LYMPHOID, 1; LYL1 



20 



MAN2B1 



GDB: 119376 



MANNOSIDOSIS, ALPHA B, 
LYSOSOMAL 



MCOLN1 



GDB: 10013974 



MUCOLIPIDOSIS W 



25 



MDRV 



GDB:6306714 



MUSCULAR DYSTROPHY, AUTOSOMAL 
DOMINANT, WITH RIMMED VACUOLES; 
MDRV 



MLLT1 



GDB: 136791 



MYELOID/LYMPHOID OR MIXED 
LINEAGE LEUKEMIA, TRANSLOCATED 
TO, 1; MLLT1 



NOTCH3 



30 



GDB:361163 



DEMENTIA, HEREDITARY 
MULTI-INF ARCT TYPE NOTCH, 
DROSOPHILA, HOMOLOG OF, 3; 
NOTCH3 



NPHS1 



GDB:342105 



NEPHROSIS I, CONGENITAL, FINNISH 
TYPE;NPHS1 __ 



OFC3 



35 OPA3 



GDB: 128060 



OROFACIAL CLEFT-3; OFC3 



GDB:9954590 



OPTIC ATROPHY, INFANTILE, WITH 
CHOREA AND SPASTIC PARAPLEGIA 



- 117- 



NY2- 1321355.1 





Gene 


GDB Accession ID 


OMIM Link 




PEPD 


GDB.120273 


PEPTIDASE D; PEPD 




PRPF31 


GDB.-333911 


RETINITIS PIGMENTOSA 1 1 ; RP 1 1 




PRTN3 


GDB:126876 


PROTEINASE 3; PRTN3; PR3 j 


5 


PRX 


GDB.l 1501256 


DEJERINE-SOTTAS 




PSG1 


GDB:120321 


PREGNANCY-SPECIFIC 

BETrA-1 -OL Y UU.r rsAJ 1 HUN 1 , r ovj i 


10 


PVR 


GDB: 120324 


POLIOVIRUS SUSCEPTIBILITY, OR 

riT~ , XTC'T'T , T\ /TT'V. D\/"C 

SENSinVll Y; rvb 




RYR1 


GDB: 120359 


HYPERTHERMIA OF ANESTHESIA 
RYANODINE RECEPTOR- 1 ; RYR1 j 




SLC5A5 


GDB.5892184 


aULU I ii ^AKxsitliN- r/VlviiL/ 1 1v1.01v.1_DU/rv- 
5; SLC5A5 


15 


SLC7A9 


GDB-.9958852 


CYoIITnUKIA, 1 I JrJti ill, ^oinuj 




STK11 


GDB:9732383 


PEUTZ-JEGHERS SYNDROME 
SERINE/THREONINE PROTEIN KINASE 
11; blKl I 


20 


TBXA2R 


GDB: 1275 17 


tt tt> r\K at* rw A xtt? AO P tPPJPTTYR 
THKUJVLdUXAJNe. /VZ JsJc^rlr i wx\^ 

PLATELET; TBXA2R 




TGFB1 


GDB: 120729 


ENGELMANN DISEASE 
TRANSFORMING uRUW lri Jf AC l UK, 
BETA-1; TGFB1 




TNNI3 


GDB: 125309 


TROPONIN I, CARDIAC; TNNI3 


25 


TYROBP 


GDB:9954457 


POLYCYSTIC LIPOMEMBRANOUS 
OSTEODYSPLASIA WITH SCLEROSING 
LEUKOENCEPHALOPATHY 




Table 21- C 


jenes, Locations and Genetic Disorders on Chromosome 20 


30 


Gene 


GDB Accession ID 


OMIM Link 


ADA 


GDB-.l 19649 


ADENOSINE DEAMINASE; ADA 




AHCY 


GDB:118983 


S-ADENOSYLHOMOCYSTEINE 
HYDROLASE; AHCY 


35 


AVP 


GDB:1 19009 


DIABETES INSIPIDUS, 
NEUROHYPOPHYSEAL TYPE ARGININE 
VASOPRESSIN; AVP 
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Gene 


CrOB Accession ID 


OMIM Link 




CDAN2 


GDB.-9823270 


DYSERYTHROPOIETIC ANEMIA, 
CONGENITAL, TYPE II 


5 


CDMP1 


GDB:438940 


CHONDRODYSPLASIA, GREBE TYPE 
CARTILAGE-DERIVED MORPHOGENETIC 
PROTEIN 1 

X lvV/ X X_-*XX > X 




CHED1 


GDB.-3837719 


CORNEAL DYSTROPHY, CONGENITAL 
ENDOTHELIAL- CHED 


10 


CHRNA4 


GDB:128169 


CHOLINERGIC RECEPTOR, NEURONAL 
NICOTINIC ALPHA POLYPEPTIDE 4" 

X 'I XV/ W X XX ^1 IV* y i*V/l X XX*. A W A-/ X X J_/X X XA-' *—i ~ 5 

CHRNA4 EPILEPSY, BENIGN NEONATAL; 

T7TJTVT 1 i 




CST3 


GDB:1 19817 


AMYLOIDOSIS VI 


15 


EDN3 


GDB- 119862 


ENDOTHELIN-3 • EDN3 

A** A ^ X-^ X X 1 i /,■ JM-X ^ y A mM^ X ^ 

WAARDENBURG-SHAH SYNDROME 


EEGV1 


GDB: 127525 


ELECTROENCEPHALOGRAM, 
LOW-VOLTAGE 




FTLL1 


GDB: 119235 


FERRITIN LIGHT CHAIN; FTL 


20 


GNAS 


GDB: 120628 


GUANINE NUCLEOTIDE-BINDING 
PROTEIN, ALPHA-STIMULATING 
POLYPEPTIDE; 


25 


GSS 


GDB:637022 


GLUTATHIONE SYNTHETASE 
DEFICIENCY OF ERYTHROCYTES, 
HEMOLYTIC ANEMIA 
PYROGLUTAMICACIDURIA 
HNF4AGDB:393281DIABETES MELLITUS, 
AUTOSOMAL DOMINANT 
TRANSCRIPTION FACTOR 14, HEPATIC 
NUCLEAR FACTOR: TCF14 

X^l W V/JjJ_/iHV X nv X V/AVj x v^x x ~ 




JAG1 


GDB6 175920 


CHOLESTASIS WITH PERIPHERAL 

v^»x A w I »i / w x xVvjxu yr x xxx X JL/ivii xii <ivr>i-( 

PULMONARY STENOSIS JAGGED 1; JAG1 


30 


KCN02 


GDB-9787229 


EPILEPSY BENIGN NEONATAL- EBN1 
POTASSIUM CHANNEL, 
VOLTAGE-GATED, SUBFAMILY Q, 
MEMBER 2 




MKKS 


GDB:9860197 


HYDROMETROCOLPOS SYNDROME 




NBIA1 


GDB:4252819 


HALLERVORDEN-SPATZ DISEASE 


35 


PCK1 


GDB.125349 


PHOSPHOENOLPYRUVATE 
CARBOXYK1NASE 1, SOLUBLE; PCK1 
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10 



Gene 


GDB Accession IP 1 


OMlMLink 




GPB:203940 


PROTEINASE INHIBITOR 3; PI3 


PI3 

PPGB 




NEURAMINIDASE DEFICIENCY WITH 
BETA-GALACTOSID ASE DEFICIENCY 


PPMD 


GDB:702144 


CORNEAL DYSTROPHY, HEREDITARY 
POLYMORPHOUS POSTERIOR PPCD 


PRNP 


GDB:120720 


GERSTMANN-STRAUSSLER DISEASE; 
GSD PRION PROTEIN; PRNP 


THBD 


GDB.l 19613 


THROMBOMODULIN; THBD 


TOPI 


GDB:120444 


TOPOISOMERASE (DNA) I; TOPI 



15 



20 



25 



Gene || 


GDB Accession IP 


OMIM Link 1 


AIRE I 


GDB:567198 


AUTOIMMUNE _ T _ T 
POLYENDOCRINOPATHY-CANDIDIA 

SIS-ECTODERMAL DYSTROPHY; 
APECED 


APP 1 


GDB: 119692 j 


ALZHEIMER DISEASE; AD AMYLOID 
BETA A4 PRECURSOR PROTEIN; APP 


CBS 1 


GDB:1 19754 | 


HOMOCYSTINURIA J 


1 COL6A1 J 


GDB: 119065 


COLLAGEN, TYPE VI, ALPHA- 1 
CHAIN; COL6A1 MYOPATHY, 
BENIGN CONGENITAL, WITH 
CONTRACTURES 


|cOL6A2 


GDB:1 19793 


COLLAGEN, TYPE VI, ALPHA.-2 
| CHAIN; COL6A2 MYOPATHY, 

BENIGN CONGENITAL, WITH 
[ CONTRACTURES 


CSTB 


1 GDB:5215249 


MYOCLONUS EPILEPSY OF 
UNVERRICHT AND LUNDBORG 
j CYSTATIN B; CSTB 


Idcr 


GDB: 125354 


1 TRISOMY 21 


[ DSCR1 


GDB:731000 


1 TRISOMY 21 


1 FPDMM 


GDB:9954610 


CORE-BINDING FACTOR, RUNT 
DOMAIN, ALPHA SUBUNIT 2; CBFA2 
PLATELET DISORDER FAMILIAL, 
WITH ASSOCIATED MYELOID 
MALIGNANCY 



-120 



NY2- 132)355.1 





Gene 


GDB Accession ID 


Uiviiivi juiot*. l 




HLCS 


GDB:392648 


\ax tt TTPT F CARBOXYLASE 1 
DEFICIENCY, BIOTIN-RESPONSIVE; 
MCD 1 


5 


HPE1 


GDB: 136065 


TjrvT rvPwn«3F>JCFPHALY FAMILIAL 1 
ALOBAR j 




ITGB2 


GDB:120574 


INTEGRIN BETA-2; ITGB2 




KCNE1 


GDB.127909 


POTASSIUM VOLTAGE-GATED 1 
CHANNEL, ISK-RELATED 
SUBFAMILY, MEMBER 1 ; | 


10 


KNO 


GDB:4073044 


KNOBLOCH SYNDROME; KNO j 




PRSS7 


GDB:384083 


ENTEROKINASE DEFICIENCY 


1 J 


RUNX1 


GDB: 1283 13 


CORE-BINDING FACTOR, RUNT \ 
DOMAIN, ALPHA SUBUNTT 2; CBFA2 
PLATELET DISORDER, FAMILIAL, 
WITH ASSOCIATED MYELOID 
MALIGNANCY 1 




SOD1 


GDB: 119596 


AMYOTROPHIC LATERAL 
SCLEROSIS SUPEROXIDE 
DISMUTASE-1; SOD1 MUSCULAR 
ATROPHY, PROGRESSIVE, WITH 
AMYOTROPHIC LATERAL I 
SCLEROSIS 


20 


TAM 


GDB:9958709 


MYELOPROLIFERATIVE I 
SYNDROME, TRANSIENT _| 



Table 23 : Genes, Locations and Genetic Disorders on Chromosome 22 



25 



Gene 


GDB Accession ID 


OMIM Link 


ADSL 


GDB:1 19655 


ADENYLOSUCCINATE LYASE; ADSL 


ARSA 


GDB:1 19007 


METACHROMATIC 

LEUKODYSTROPHY, LATE-INFANTILE 


BCR 


GDB:120562 


BREAKPOINT CLUSTER REGION; BCR 


CECR 


GDB: 119772 


CAT EYE SYNDROME; CES 


CHEK2 


GDB-.9958730 


LI-FRAUMENI SYNDROME; LFS 
OSTEOGENIC SARCOMA 


COMT 


GDB: 119795 


CATECHOL-O-METHYLTRANSFERASE 
;COMT 



35 
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Vlvuv 


GDB Accession ID 


OMDM Link i 




CRYBB2 


GDB: 119075 


CRYSTALLIN, BETA B2; CRYBB2 
CATARACT, CONGENITAL, 
CERULEAN TYPE, 2; CCA2 


5 


CSF2RR 


GDB '12683 8 


GRANULOCYTE-MACROPHAGE 
COLONY-STIMULATING FACTOR 




CTHM 


GDB:439247 


HEART MALFORMATIONS ; CTHM 


10 


TYP2D6 


GDB-132127 


CYTOCHROME P450, SUBFAMILY HD; 
CYP2D | 




CYP2D@ 


GDB: 119832 


CYTOCHROME P450, SUBFAMILY nD; 










15 


D1A1 


#TTYQ»1 1 QQAft 

LrJJrS.l !Vo4c$ 


VTPTWFTWfnfrT OBTNFMTA DT IE TO 

1YLE* x xxJC/XVXvx \JX-/V_y O X1N J_>lVXX/-\. XV X-f X \-/ 

DEFICIENCY OF METHEMOGLOBIN 
REDUCTASE 




EWSR1 


GDB.135984 


E WING SARCOMA; bwa 




GGT1 


GDB:120623 


GLUTATHIONURIA 


20 


MGCR 


GDB:120180 


MENINGIOMA; MGM 


"K VTXT1 

MN1 




lVxx-»tNllN\JlV^lVl/\., ivlVJlVl 




NAGA 


GDB: 119445 


ALPHA-GALACTOSIDASE B; GALB 






GDB- 120232 


NEUROFIBROMATOSIS, TYPE II; NF2 


25 




GDB -99546 19 


HYPERTELORISM WITH ESOPHAGEAL 
ABNORMALITY AND HYPOSPADIAS 




PDOFR 
ruur d 


GDB- 120709 


V-SIS PLATELET-DERIVED GROWTH 
FACTOR BETA POLYPEPTIDE; PDGFB 




PPARA 


GDB:202877 


PEROXISOME PROLIFERATOR 
ACTIVATED RECEPTOR ALPHA; 
PPARA 


30 


PRODH 


GDB:5215168 


HYPERPROLINEMIA, TYPE I 




SC02 


GDB:9958568 


CYTOCHROME c OXIDASE 
DEFICIENCY 




SCZD4 


GDB: 1387047 


SCHIZOPHRENIA DISORDER-4; SCZD4 


35 


SERPIND1 


GDB: 120038 


HEPARIN COFACTOR H; HCF2 



Gene 


OUB Accession jlu 


CllVJJiVl XjIIIK 


ct r 1 ^ a i 

oJLA^3/\l 


KJUD. 1ZUJ / J 


<sOT TITF CARRIER FAMILY 5 
MEMBER 1;SLC5A1 


SOX10 


GDB.9834028 


SRY-BOX 10; SOX10 


TCN2 


GDB: 119608 


TRANSCOBALAMIN H DEFICIENCY 


TIMP3 


GDB:138175 


TISSUE INHIBITOR OF 
METALLOPROTEINASE-3; TIMP3 


VCF 


GDB.-136422 


VELOCARDIOFACIAL SYNDROME 



Table 24: Genes, Locations and Genetic Disorders on Chromosome X 



25 



30 



35 



Gene 


GDB Accession ID 


OMIM Link 


ABCD1 


ODBrl 18991 


ADRENOLrb/U KaJJJ Y o 1 KUr ri Y ; AL,U 


ACTL1 


GDB: 119648 


ACl IN-LlKJi ok(^UJ}NUJb-l 9 AU1JL1 


ADFN 


GDB: 118977 


ALBINISM-DEAFNESS SYNDROME; ADFN; 
ALDS 


AGMX2 


GDB:1 19661 


AGAMMAGLOBULINEMIA, X-LINKED, 


AHDS 


GDB.T25899 


MENTAL RETARDATION, X-LINKED, WITH 
HYPOTONIA 


AIC 


GDB:1 18986 


CORPUS CALLOSUM, AGENESIS OF, WITH 
CHORIORETINAL ABNORMALITY 


AIED 


GDB: 119663 


ALBINISM, OCULAR, TYPE 2; OA2 


AIH3 


GDB: 131443 


AMELOGENESIS IMPERFECTA-3, 
HYPOPLASTIC TYPE; AIH3 


ALAS2 


GDB: 119666 


ANEMIA, HYPOCHROMIC 


AMCD 


GDB:5584286 


ARTHROGRYPOSIS MULTIPLEX 
CONGENITA, DISTAL 


AMELX 


GDB: 119675 


AMELOGENESIS IMPERFECTA- 1, 
HYPOPLASTIC TYPE; AIH1 


ANOP1 


GDB: 128454 


CLINICAL; ANOP1 


AR 


GDB: 120556 


ANDROGEN INSENSITWITY SYNDROME; 
AIS ANDROGEN RECEPTOR; AR 


ARAF1 


GDB: 119004 


V-RAF MURINE SARCOMA 361 1 VIRAL 
ONCOGENE HOMOLOG 1; ARAF1 
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Gene 


GDB Accession ID 


OMIMLink 




ARSC2 


GDB: 119702 


ARYLSULFATASE C, t rUKM, AKbC2 


5 


ARSE 


GDB:555743 


CHONDRODYSPLASIA PUNC1 ATA 1, 
X-LINKED RECESSIVE; CDPX1 




ARTS 


GDB:9954651 


FATAL X-LINKED, WITH DEAFNESS AND 
LOSS OF VISION 




ASAT 


GDB:9954649 


SIDEROBLASTIC, AND 

SPINOCEREBELLAR ATAXIA; ASAT j 


10 


ASSP5 


GDB: 119019 


CITRULLINEMIA 




ATP7A 


GDB: 119395 


ATPase, Cu(2+)-TRANSPORTING, ALPHA 

DAT vrt>t?T5TTF\T7« ATP7A NilTHSJl^PS 

SYNDROME 


15 


ATRX 


GDB: 136052 


ALPHA-THALAS SEMIA/MENTAL 
RETARDATION SYNDROME, X-L.UNKJ}L>, 
ATRX ALPHA-THALAS SEMI A/MENTAL 
RETARDATION SYNDROME, 
NONDELETION TYPE 




AVPR2 


GDB:131475 


DIABETES INSIPIDUS, NEPHROGENIC 




BFLS 


GDB: 120566 


BORJESON SYNDROME; BORJ 


20 


BGN 


GDB:1 19727 


B1GLYCAN; BGN 




BTK 


GDB:120542 


BRUTON AGAMMAGLOBULINEMIA 
TYROSINE KINASE; BTK 




BZX 


GDB: 52059 12 




25 


C1HR 


GDB: ll 9040 


TATA DAY orKTT^TXTri PT? OTFTTsI 

(TBP)-ASSOCIATED FACTOR 2A; TAF2A 




CACNA1F 


GDB:6053864 


NIGHTBLINDNESS, CONGENITAL 
blAilUNAKY, A-J-llNIsJllJ, I irJi ^, >->in oz 
CALCIUM CHANNEL, 
VOLTAGE-DEPENDENT, ALPHA IF 
SUBUNIT; CACNA1F 


30 


CALB3 


GDB: 133780 


CALBINDIN 3; CALB3 






GDB :995 8963 


COLORBLINDNESS, 

BLUE-MONO-CONE-MONOCHROMATIC 
TYPE; CBBM 


35 


CCT 


GDB: 119756 


CATARACT, CONGENITAL TOTAL, WITH 
POSTERIOR SUTURAL OPACITIES IN 
HETEROZYGOTES; 
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Gene 


(jDi> Accession JLU 


AMTM Y Snlr 
dlVlliVl LUIK 




CDR1 


GDB:1 19053 


CEREBELLAR DEGENERATION-RELATED 


5 


CFNS 


GDB:9579470 


CRANIOFRONTONASAL SYNDROME; 




CGF1 


(jD1J:o2/5oo/ 


r\ OXTTTT fYKJ 
vAJCflNl 1 lvJXN 




CHM 


GDB.120400 


CHOROIDEREMIA; CHM 


10 


CHR39C 


GDB: 119779 


CHOLESTEROL REPRESSIBLE PROTEIN 
39C; CHR39C 




CIDX ; 


GDB:127736 


SEVERE COMBINED IMMUNODEFICIENCY 
DISEASE, X-LINKED, 2; SCIDX2 




CLA2 


GDB: 119782 


CEREBELLAR ATAXIA, X-LINKED; CLA2 


15 


CLCN5 


GDB:270667 


CHLORIDE CHANNEL 5; CLCN5 FANCONI 
SYNDROME, RENAL, WITH 
NEPHROCALCINOSIS AND RENAL 
STONES NEPHROLITHIASIS, X-LINKED 
RECESSIVE, WITH RENAL FAILURE; XRN 


20 




GDB- 119784 


RIBOSOMAL PROTEIN S6 KINASE, 90 KD, 
POLYPEPTIDE 3; RPS6KA3 

LUr r UN -JLA-J W tv I ox IN u ivV-J iviJC/ , s^i^o 


CM 1X2 


Kjud: i Zoo 1 1 


X-LINKED RECESSIVE, 2; CMTX2 




CMTX3 


GDB: 128151 


CHARCOT-MARIE-TOOTH NEUROPATHY, 
X-LINKED RECESSIVE, 3; CMTX3 


25 


CND 


GDB-.9954627 


DERMOIDS OF CORNEA; CND 


COD1 


GDB: 11 9787 


CONE DYSTROPHY, X-LINKED, 1 ; COD1 




COD2 


GDB:6520166 


CONE DYSTROPHY, X-LINKED, 2; COD2 


30 


COL4A5 


GDB: 120596 


COLLAGEN, TYPE TV, ALPHA-5 CHAIN; 
COL4A5 LEIOMYOMATOSIS, 
ESOPHAGEAL AND VULVAL, WITH 
NEPHROPATHY 




COL4A6 


GDB:222775 


COLLAGEN, TYPE TV, ALPHA-6 CHAIN; 
COL4A6 LEIOMYOMATOSIS, 
ESOPHAGEAL AND VULVAL, WITH 
NEPHROPATHY 


•3C 


CPX 


GDB: 120598 


CLEFT PALATE, X-LINKED; CPX 
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Gene 


GDB Accession ID 


OMEVf Link 




CVD1 


GDB.9954659 


CARDIAC VALVULAR DYSPLASIA, 
X-LINKED 


5 


CYBB 


GDB:120513 


GRANULOMATOUS DISEASE, CHRONIC; 
CGD 




DCX 


GDB:9823272 


LISSENCEPHALY, X-LINKED 




DFN2 


GDB: 119091 


DEAFNESS, X-LINKED 2, PERCEPTIVE 
CONGENITAL; DFN2 


10 


DFN4 


GDB:433255 


DEArNcSb, A-LllNlvtlL> 4, COJMvjillNl 1 AL 
SENSORINEURAL; DFN4 




DFN6 


GDB: 1320698 


DEAFNESS, X-LINKED, 6, PROGRESSIVE; 
DbNo 




DHOF 


GDB: ll 9847 


*Er\f* A T r\DD A /f A T UVDnDT ACTA- rYU/TP 


15 


DIAPH2 


GDB:9835484 


DIAPHANOUS, DROSOPHILA, HOMOLOG 
CONGENITA; DKC 




DMD 


GDB: 119850 


MUoOUJLAK JJYblKUrrlY, 
PSEUDOHYPERTROPHIC PROGRESSIVE, 
DUCHENNE AND BECKER 


20 


DSS 


GDB:433750 


r\nc a OTZ? OT7TvTCTT*T\7'C CCY CAT - HQQ 

DUoAGrS-oJblNolllVii oliX KJlVJDKoALr, JJoo 




DYT3 


GDB: 11 8789 


TORSION DYSTONIA-3, X-LINKED TYPE; 
DYT3 




EBM 


GDB:1 19102 


BULLOUS DYSTROPHY, HEREDITARY 
MACULAR lYPb 


25 


EBP 


GDB:125212 


CHONDRODYSPLASIA PUNCTATA, 
X-LINKED DOMINANT; CDPX2; CDPXD; 
CPXD 




EDI 


GDB:1 19859 


ECTODERMAL DYSPLASIA, ANHIDROTIC; 
EDA 


30 


ELK1 


GDB: 119867 


tiLKl, MJiMrsriK Ur Jala UJNCVJvjiiJNil 
FAMILY; ELK1 




EMD 


GDB: 119108 


MUSCULAR DYSTROPHY, TARDIVE, 
DREIFUSS-EMERY TYPE, WITH 
CONTRACTURES 


35 


EVR2 


GDB: 136068 


EXUDATIVE VITREORETINOPATHY, 
FAMILIAL, X-LINKED RECESSIVE; EVR2 


F8C 


GDB: 119124 


HEMOPHILIA A 
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Gene 


GDB Accession ID 


OMIM Link 




F9 


GDB: 119900 


HEMOPHILIA B; HEMB 


5 


FCP1 


GDB:347490 


r-^HL.L, JrKLUJUCllUIN, X-L1NKED; FCPX 




FDPSL5 


GDB: 119922 


SYNTHETASE-5; FPSL5 




FGD1 


GDB:1 19131 


SYNDROME FACIOGENTTAL DYSPLASIA; 
FGDY 




FGS1 


GDB:9836950 


FG SYNDROME 


10 


FMR1 


GDB: 129038 


FRAGILE SITE MENTAT RFTAPFIattotsj i • 
FMR1 




FMR2 


GDB: 141 566 


FRAGILE SITE, FOLIC ACID TYPE, RARE, 
FRA(X)(q28); FRAXE 




G6PD 


GDB: 120621 


GLUCOSE-6-PHOSPHATE 
DEHYDROGENASE; G6PD 


15 


GABRA3 


GDB: 119968 


GAMMA- AMIN OBUTYRIC ACID 
RECEPTOR, ALPHA-3; GABRA3 




GATA1 


GDB: 125373 


GATA-BINDING PROTEIN 1; GATA1 


20 


GDI1 


GDB: 1347097 


GDP DISSOCIATION INHIBITOR 1; GDI1 

mental Retardation x-LiMKFn 

NONSPECIFIC, TYPE 3; MRX3 




GDXY 


GDB:9954629 


DYSGENESIS, XY FEMALE TYPE; GDXY 


25 


GJB1 


GDB: 125246 


CHARCOT-MARIE-TOOTH PFROMPAT 
MUSCULAR ATROPHY, X-LINKED; CMTC1 
GAP JUNCTION PROTEIN, BETA-1, 32 KD: 

vJJO 1 




GK 


GDB*1 19271 


ri i jrliKvjJL Y CcKULjhMI A 




GLA 


GDB: 11 9272 


ANGIOKERATOMA, DIFFUSE 




GPC3 


GDB.-3770726 


GLYPICAN-3; GPC3 SIMPSON 
DYSMORPHIA SYNDROME- SDYS 


30 


GRPR 


GDB: 128035 


GASTRIN-RELEASING PEPTIDE 
RECEPTOR; GRPR 




GTD 


GDB:9954635 


GONADOTROPIN DEFICIENCY; GTD i 




GUST 


GDB:9954655 


MENTAL RETARDATION WITH OPTIC 
ATROPHY, DEAFNESS, AND SEIZURES 


35 


HMS1 


GDB:251827 


1;HMS1 
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HPRT1 


GDB:1 19317 


HYPOXANTHEME GUANINE 
PHOSPHORIBOSYLTRANSFERASE 1; 
HPRT1 


5 


HPT 


GDB: 119322 


HYPOPARATHYROIDISM, X-LINKED; 
HYPX 




HTC2 


GDB:700980 


HYPERTRICHOSIS, CONGENITAL 
GENERALIZED; CGH; HCG 


10 


HTR2C 


GDB:378202 


5-@HYDROXYTRYPTAMINE RECEPTOR 
2C; HTR2C 




IIYR 


GDB:9954625 


REGULATOR; HYR 




IDS 


GDB:120521 


MUCOPOLYSACCHARIDOSIS TYPE II 




IHG1 


GDB:1 19343 


HYPOPLASIA OF, WITH GLAUCOMA; IHG 


15 


IL2RG 


GDB: 134807 


INTERLEUKIN-2 RECEPTOR, GAMMA; 
IL2RG SEVERE COMBINED 
IMMUNODEFICIENCY DISEASE, 
X-LINKED, 2; SCIDX2 




INDX 


GDB:9954657 


IMMUNONEUROLO GIC DISORDER, 
X-LINKED 


20 


IP1 


GDB:120105 


INCONTINENTIA PIGMENTI, TYPE I; IP1 




IP2 


GDB: 120106 


INCONTINENTIA PIGMENTI, TYPE II; IP2 




JMS 


GDB:204055 


MENTAL RETARDATION, X-LINKED, WITH 
GROWTH RETARDATION, DEAFNESS, 
AND 


25 


KALI 


GDB: 1201 16 


KALLMANN SYNDROME 1; KALI 




KFSD 


GDB: 128 174 


KERATOSIS FOLLICULARIS SPINULOSA 
DECALVANS CUM OPHIASI; KFSD 


30 


L1CAM 


GDB:120133 


CLASPED THUMB AND MENTAL 
RETARDATION LI CELL ADHESION 
MOLECULE; L1CAM 


LAMP2 


GDB: 125376 


LYSOSOME-ASSOCIATED MEMBRANE 
PROTEIN B; LAMP2; LAMPB 




MAA 


GDB: 119372 


MICROPHTHALMIA OR ANOPHTHALMOS, 
WITH ASSOCIATED ANOMALIES; MAA 




MAFD2 


GDB: 119373 


PSYCHOSIS, X-LINKED 


35 


MAOA 


GDB: 1201 64 


MONOAMINE OXIDASE A; MAO A 
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MAOB 


GDB: 119377 


MONOAMINE OXIDASE B; MAOB 


MCF2 


GDB:120168 


MCF.2 CELL LINE DERIVED 
TRANSFORMING SEQUENCE; MCF2 


MCS 


GDB:128370 


MENTAL RETARDATION, X-LINKED, 
SYNDROMIC-4, WITH CONGENITAL 
CONTRACTURES 


ME AX 


GDB:1 19383 


X-LINKED, WITH EXCESSIVE 
AUTOPHAGY- XMEA- MEAX 


MECP2 


GDB:3851454 ! 


SYNDROME; RTT 


MF4 


GDB: 119386 


METACARPAL 4-5 FUSION; MF4 


MGC1 


GDB:120179 


MEGALOCORNEA; MGC1; MGCN 


MIC5 


GDB:120526 


SURFACE ANTIGEN, X-LINKED; SAX 


MIDI 


GDB:9772232 


OPITZ SYNDROME 


MLLT7 


GDB:392309 


MYELOID/LYMPHOID OR MIXED-LINEAGE 
LEUKEMIA, TRANSLOCATED TO, 7; 
MLLT7 


MLS 


GDB:262123 


MICROPHTHALMIA WITH LINEAR SKIN 
DEFECTS; MLS 


MRSD 


GDB: 119398 


MENTAL RETARDATION, SKELETAL 
DYSPLASIA, AND ABDUCENS PALSY; 
MRSD 


MRX14 


GDB: 138453 


RETARDATION, X-LINKED 14; MRX14 


MRX1 


GDB: 120 193 


MENTAL RETARDATION, X-LINKED 
NONSPECIFIC, TYPE 1; MRX1 


MRX20 


GDB:2 17050 


MENTAL RETARDATION, X-LINKED 20; 
MRX20 


MRX2 


GDB: 120194 


RETARDATION, X-LINKED NONSPECIFIC, 
TYPE 2; MRX2 


MRX3 


GDB.128105 


GDP DISSOCIATION INHIBITOR 1; GDI1 
MENTAL RETARDATION, X-LINKED 
NONSPECIFIC, TYPE 3; MRX3 


MRX40 


GDB:700754 


MENTAL RETARDATION, X-LINKED, WITH 
HYPOTONIA 


MRXA 


GDB.9954641 


MENTAL RETARDATION, X-LINKED 
NONSPECIFIC, WITH APHASIA; MRXA 



10 



15 



20 



25 



30 



35 
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MSD 



GDB:1 19399 



SYNDROME 



MTM1 



MYCL2 



GDB: 119439 



GDB: 120209 



MYOTUBULAR MYOPATHY 1 ; MTM1 



MYCL-RELATBD PROCESSED GENE; 
MYCL2 



MYP1 



NDP 



GDB:127783 



GDB: 119449 



MYOPIA, X-LINKED; MYP1 



NORRIE DISEASE; NDP 



10 



NHS 



NPHL1 



GDB:120235 



| GDB:433705 



CATARACT-DENTAL SYNDROME 



NEPHROLITHIASIS, X-LINKED RECESSIVE, 
WITH RENAL FAILURE; XRN 



15 



NR0B1 



NSX 



GDB: 118982 



GDB: 125596 



ADRENAL HYPOPLASIA, CONGENITAL; 
AHC 



SYNDROME; NSX 



NYS1 



NYX 



GDB:1 19458 



GDB:1 19814 



NYSTAGMUS, X-LINKED; NYS 



NIGHTBLINDNESS, CONGENITAL 
STATIONARY, WITH MYOPIA; CSNBl 



OA1 



20 OASD 



GDB: 119459 



GDB:138457 



ALBINISM, OCULAR, TYPE1;QA1 



OCULAR, WITH LATE-ONSET 
SENSORINEURAL DEAFNESS; OASD 



OCRL 



GDB: 119461 



LOWE OCULOCEREBRORENAL 
SYNDROME; OCRL 



ODT1 



GDB: 125360 



TEETH, ABSENCE OF 



25 OFD1 



OPA2 



OPD1 



GDB: 120248 



GDB:125358 



GDB: 120249 



OROFACIODIGITAL SYNDROME 1; OFD1 
OPTIC ATROPHY 2; OPA2 



OTOPALATODIGITAL SYNDROME 



OPEM 



GDB: 119467 



OPHTHALMOPLEGIA, EXTERNAL, AND 
MYOPIA; OPEM 



30 



OPN1LW GDB: 120724 



OPN1MW GDB: 120622 



COLORBLINDNESS, PARTIAL, PROTAN 
SERIES; CBP 



COLORBLINDNESS, PARTIAL, DEUTAN 
SERIES; CBD; DCB 



35 



OTC 



GDB: 119468 



ORNITHINE TRANSCARB AMYLASE 
DEFICIENCY, HYPERAMMONEMIA DUE 
TO; OTC 
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P3 


GDB:9954667 


PROTEIN P3 


5 


PDHA1 


GDB: 11 8895 


PYRUVATE DEHYDROGENASE COMPLEX, 
El -ALPHA POLYPEPTIDE- 1 ; PDHA 1 




PDR 


GDB:203409 


AMYLOIDOSIS, FAMILIAL CUTANEOUS 




PFC 


GDB: 120275 


PROPERDIN DEFICIENCY, X-LINKED 




PFKFB1 


GDB.-125375 


6-@PHOSPHOFRUCTO-2-KINASE; PFKFB1 


10 


PGK1 


GDB:120282 


PHOSPHOGLYCERATE KINASE 1; PGK1 




rdK-lrl 


CjUo.1zU2o3 






PGS 




ri a xm v xx/ a t vtjd \a a t ttot? tva a tttoxt ah/ttu 
UAINU i - W AJjivbK JVL/VLr UxClvlA. 1 JLUIN W 1 1 rl 

MENTAL RETARDATION, BASAL 
GANGLIA DISEASE, 


15 


PHEX 


GDB: 120520 


HYPOPHOSPHATEMIA, VITAMIN 
D-RESISTANT RICKETS; HYP 




PHKA1 


GDB:120285 


PHOSPHORYLASE KINASE, ALPHA 1 
SUBUNIT (MUSCLE); PHKA1 




PHKA2 


GDB:127279 


GLYCOGEN STORAGE DISEASE VIII 


20 


PHP 


GDB:1 19494 


PANHYPOPITUITARISM; PHP 




PIGA 


GDB:138138 


PHOSPHATIDYLINOSITOL GLYCAN, 
CLASS A; PIGA 




PLP1 


GDB:120302 


PROTEOLIPID PROTEIN, MYELIN; PLP 




POF1 


GDB:120716 


PREMATURE OVARIAN FAILURE 1; POF1 


25 


POLA 


GDB:120304 


POLYMERASE, DNA, ALPHA; POLA 




rUU3r4 


vjDx3:35l3oo 


FIXATION 


30 


ppuv 




RFTARnATTOTvJ WTTT-T P^YPWOSTS 
IxJD 1 t\S\Ur\ 1 l\-/rN wiin. I V/nUOlO, 

PYRAMIDAL SIGNS, AND 
MACROORCHIDISM 


PRD 


GDB.-371323 


DYSPLASIA, PRIMARY 




PRPS1 


GDB:120318 


PHOSPHORIBOSYLPYROPHOSPHATE 
SYNTHETASE-I; PRPS1 


35 


PRPS2 


GDB:120320 


PHOSPHORIBOSYLPYROPHOSPHATE 
SYNTHETASE-H; PRPS2 
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PRS 


GDB: 128368 


MENTAL RETARDATION, X-LINKED, 
SYNDROMIC-2, WITH DYSMORPHISM 
AND CEREBRAL 


5 


PRTS 


GDB: 128367 


PARTINGTON X-LINKED MENTAL 
RETARDATION SYNDROME; PRTS 




PSF2 


GDB: 119519 


TRANSPORTER 2, ABC; TAP2 




RENBP 


GDB:133792 


RENIN-BINDING PROTEIN; RENBP 


10 


RENS1 


GDB:9806348 


MENTAL RETARDATION, X-LINKED, 
RENPENNING TYPE 




RP2 


GDB: 120353 


RETINITIS PIGMENTOSA-2; RP2 




RP6 


GDB: 125381 


PIGMENTOSA-6; RP6 




RPGR 


GDB: 118736 


RETINITIS PIGMENTOSA-3; RP3 


15 


RPS4X 


GDB: 128 115 


RIBOSOMAL PROTEIN S4, X-LINKED; 
RPS4X 




RPS6KA3 


GDB:365648 


RIBOSOMAL PROTEIN S6 KINASE, 90 KD, 
POLYPEPTIDE 3; RPS6KA3 




RSI 


GDB:1 19581 


RETINOSCfflSIS; RS 


zu 


Sll 


GDB:120361 


ANTIGEN, X-LINKED, SECOND; SAX2 




SDYS 


GDB: 119590 


GLYPICAN-3; GPC3 SIMPSON 
DYSMORPHIA SYNDROME; SDYS 




SEDL 


GDB: 120372 


SPONDYLOEPIPHYSEAL DYSPLASIA, 
LATE; SEDL 


oc 
ZJ 


SERPINA7 


GDB:120399 


THYROXINE-BINDING GLOBULIN OF 
SERUM; TBG 




SH2D1A 


GDB: 120701 


IMMUNODEFICIENCY, X-LINKED 
PROGRESSIVE COMBINED VARIABLE 


30 


SHFM2 


GDB:226635 


SPLIT-HAND/SPLIT-FOOT ANOMALY, 
X-LINKED 




SHOX 


GDB:61 18451 


SHORT STATURE; SS 




SLC25A5 


GDB:125190 


ADENINE NUCLEOTIDE TRANSLOCATOR 
2;ANT2 


35 


SMAX2 


GDB:9954643 


SPINAL MUSCULAR ATROPHY, X-LINKED 
LETHAL INFANTILE 
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SRPX 


GDB:38 11398 


RETINITIS PIGMENTOSA-3; RP3 


5 


SRS 


GDB: 136337 


MENTAL RETARDATION, X-LINKED, 
SNYDER-ROBINSON TYPE 




STS 


GDB: 120393 


ICHTHYOSIS, X-LINKED 




SYN1 


GDB:1 19606 


SYNAPSIN I; SYN1 




SYP 


GDB:125295 


SYNAPTOPHYSIN; SYP 


10 


TAF1 


GDB: 120573 


TATA BOX BINDING PROTEIN 

/Tom AOOnPTATRn PAPTOl? 9A* TAP9A 
^ 1 Drj-AooULlA 1 lllj r /\^ 1 \Jm\. At\ 9 i /-vr **t\. 




TAZ 


GDB: 120609 


CARDIOMYOPATHY, DILATED 3 A; CMD3A 
ENDOCARDIAL FIBROELASTOSIS-2; EFE2 




TBX22 


GDB: 10796448 


CLEFT PALATE, X-LINKED; CPX 


15 


TDD 


GDB: 119610 


MALE PSEUDOHERMAPHRODITISM: 
DEFICIENCY OF TESTICULAR 

1*7 on TVCGA/f fVT A QT7* 




TFE3 


GDB:125870 


IMMUNOGLOBULIN HEAVY-CHAIN 
ENHANCER-3; TFE3 


20 


THAS 


GDB: 1281 58 


THORACOABDOMINAL SYNDROME; TAS 




THC 


GDB: 125361 


THROMBOCYTOPENIA, X-LINKED; THC; 

VT T 

XL. 1 l 




TIMM8A 


GDB:1 19090 


DEAFNESS 1, PROGRESSIVE; DFN1 


25 


TIMP1 


GDB:1 19615 


TISSUE INHIBITOR OF 

x vTcnr ATT r^o'D r\T"DTKT A QT7 1 • XT A /TP 1 


TKCR 


GDB:1 19616 


TORTICOLLIS, KELOIDS, 

CRYr 1 UK-CrilJJloM, AJN1J KJiiNAi-. 

DYSPLASIA; TKC 




TNFSF5 


GDB:120632 


IMMUNODEFICIENCY WITH INCREASED 
IgM 


30 


UBE1 


GDB:1 18954 


UBIQUITIN-ACTIVATING ENZYME 1; UBE1 




UBE2A 


GDB:131647 


UBIQUITIN-CONJUGATING ENZYME E2A; j 
UBE2A 




WAS 


GDB: 120736 


WISKOTT-ALDRICH SYNDROME; WAS 


35 


WSN 


GDB:125864 


PARKINSONISM, EARLY-ONSET, WITH ! 
MENTAL RETARDATION 
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WTS 


GDB: 128373 


MENTAL RETARDATION, X-LINKED, 
SYNDROMIC-6, WITH GYNECOMASTIA 
AND OBESITY; 


WWS 


GDB: 120497 


WIEACKER SYNDROME 


XIC 


GDB: 120498 


X-INACTTVATION-SPECIFIC TRANSCRIPT; 
XIST 


XIST 


GDB: 126428 


X-INACTIVATION-SPECIFIC TRANSCRIPT; 
XIST 


XK 


GDB: 120499 


Xk LOCUS 


XM 


GDB: 119634 


XM SYSTEM j 


XS 


GDB: 11 9636 


LUTHERAN SUPPRESSOR, X-LINKED; XS; 
LUXS 


ZFX 


GDB: 120502 


ZINC FINGER PROTEIN, X-LINKED; ZFX 


ZIC3 


GDB-.249141 


HETEROTAXY, X-LINKED VISCERAL; 
HTX1 


ZNF261 


GDB:9785766 


MENTAL RETARDATION, X-LINKED; 
DXS6673E 


ZNF41 


GDB: 125865 


ZINC FINGER PROTEIN-41 ; ZNF41 


ZNF6 


GDB: 120508 


ZINC FINGER PROTEIN-6; ZNF6 



Table 25: Genes, Locations and Genetic Disorders on Chromosome Y 



25 



30 



35 



Gene 


GDB Accession ID 


OMIM Link 


AMELY 


GDB: 119676 


AMELOGENIN, Y-CHROMOSOMAL; AMELY 


ASSP6 • 


GDB: 1 19020 


CITRULLINEMIA 


AZF1 


GDB:1 19027 


AZOOSPERMIA FACTOR 1; AZF1 


AZF2 


GDB:456131 


AZOOSPERMIA FACTOR 2; AZF2 


DAZ 


GDB:635890 


DELETED IN AZOOSPERMIA; DAZ 


GCY 


GDB: 119267 


CONTROL, Y-CHROMOSOME INFLUENCED; 
GCY 


RPS4Y 


GDB: 128052 


RIBOSOMAL PROTEIN S4, Y-LINKED; RPS4Y 


SMCY 


GDB:5875390 


HISTOCOMPATIBILITY Y ANTIGEN; HY; 
HYA 


SRY 


GDB: 125556 


SEX-DETERMINING REGION Y; SRY 
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ZFY 


GDB:120503 


ZINC FINGER PROTEIN, Y-LINKED; ZFY 


Table 26: 


Genes, Locations and Genetic Disorders in Unknown or Multiple 
Locations 



Gene 



ABAT 



AFD1 



BCH 



CCAT 



25 



|CECR9 
IcEPA 



30 CLA1 
|CLA3 



GDB Accession ID IOMIM Link 



GDB:581658 



IGAMMA-AMINOBUTYRATE TRANSAMINASE 



GDB: 128360 
GDB:265277 



Iacrodermatitis ENTEROPATHICA, 
ZI NC-DEFICIENCY TYPE; AEZ 

iFTT TFORME ADNATUM AND CLEFT PALATE 



lGDB-265292 DYSOSTOSIS, TREACHER COLLINS TYPE, 
|UL»b./o= WITHLIMB ANOMALIES 



lAGSl |GDB:10795417 
|GDB:6837715 



ENCEPHALOPATHY, FAMILIAL INF^ANTTLE, 
1 WITH CA LCIFICATION OF BASAL GANGLIA 

lFARBER LIPOGRANULOMATOSIS 



|gDB:6276Q19 
IgDB:136259 



GDB:1 18758 



AT RIAL SEPTAL DEFECT; ASP 

CETYLSEROTONIN METHYLTRANSFERASE; 
ASMT ACETYLSEROTONIN 
METHYLTRANSFERASE, Y-CHROMOSOMAL; 
ASMTY; HIOMTY 



GDB: 11 873 8 



{C HOREA, HEREDITARY BENIGN; BCH 
ICATARACT, CONGENITAL OR JUVENILE 



|GDB:1079616 3 
IgDB:581848 



^ AT EYE SYNDROME; CES 
CONTROL, CONGENITAL FAILURE OF 



CHED2 GDB.9957389 



CORNEAL DYSTROPHY, CONGENITAL 
HEREDITARY 



| GDB:1 19781 
|GDB: 128453 



GDB: 125229 
IgDB:119070 



CEREBE LLOPARENCHYMAL DISORDER ffl 
CEREBELLOPARENCHYMAL DISORDER I; CPD 1 1 



CE ROID-LIPOFUSCIN OSIS, NE URONAL 4; CLN4 
COPROPORPHYRIA 



35 
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CSF2RA GDB:1 18777 



COLONY STIMULATING FACTOR 2 RECEPTOR, 

ALPHA; CSF2RA 
GRANULOCYTE-MACROPHAGE 
COLONY-STIMULATING FACTOR RECEPTOR, 
ALPHA SUBUNIT, 



CTS1 



GDB: 11 8779 



CARPAL TUNNEL SYNDROME; CTS; CTS1 



DF 



GDB: 132645 



FACTOR D 



DIH1 



GDB:439243 



DIAPHRAGMATIC 



10 



)WS 



GDB:128371 



SYNDROME; DWS 



IDYT2 



lGDB-1 18788 DYSTONIA MUSCULORUM DEFORMANS 2; 

PYT2 



DYT4 



lGDB-43375 1 DYSTONIA MUSCULORUM DEFORMANS 4; 

IDYT4 



15 



IEBR3 IGDB- 1 1 8739 EPIDERMOLYSIS BULLOSA DYSTROPHICA 

iUROTROPHICA 



ECT 



GDB: 128640 



CENTRALOPATHIC EPILEPSY 



IeEF1A1L141gDB:1327185 IPROSTATIC CARCINOMA ONCOGENE PTI-1 



20 IEYCL2 |GDB:4642815 |eYE COLOR-3; EYCL3 



FA1 



|GDB- 1 1 8795 FANCONI ANEMIA, COMPLEMENTATION 

GROUP A; FACA 



IpANCB GDB:9864269 IfANCONI PANCYTOPENIA, TYPE 2 



GCSH 



GDB: 126842 



25 



HYPERGLYCINEMIA, ISOLATED NONKETOTIC, 
TYPE III; NKH3 



GCSL 



GDB: 132 139 



ISOLATED NONKETOTIC, TYPE IV; NKH4 



IGDF5 |GDB:433948 CARTILAGE-DERIVED MORPHOGENETIC 

PROTEIN 1 



GIP 



GDB: 11 9985 



GASTRIC INHIBITORY POLYPEPTIDE; GIP 



30 



GTS 



GDB: 118807 



GILLES DE LA TOURETTE SYNDROME; GTS 



HHG " IgDB:118740 IhYPERGONADOTROPIC HYPOGONADISM; HHG 



Ihmi 



GDB:265275 



OF ITO; HMI 



HOAC 



GDB: 118812 



35 



DEAFNESS, CONGENITAL, AUTOSOMAL 
RECESSIVE 
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HOKPP2 


GDB:595535 


HYPOKALEMIC PERIODIC PARALYSIS, TYPE II; 
HOKPP2 


5 


HRPT1 


GDB:125252 


HYPERPARATHYROIDISM, FAMILIAL PRIMARY 




HSD3B3 


GDB:676973 


GIANT CELL HEPATITIS NEONATAL 




HTC1 


GDB:265286 


HYPERTRICHOSIS UNIVERSALIS CONGENITA, 
AMBRAS TYPE; HTC1 


10 


HV1S 


GDB.-9955009 


HERPES VIRUS SENSITIVITY; HV1S 


ICR1 


GDB: 127785 


LAMELLAR, AUTOSOMAL DOMINANT FORM 




ICR5 


GDB: 127789 


ICHTHYOSIS CONGENITA, HARLEQUIN FETUS 
TYPE 


15 


IL3RA 


GDB: 128985 


INTERLEUKIN-3 RECEPTOR, ALPHA; IL3RA 
INTERLEUKIN-3 RECEPTOR, 
Y-CHROMOSOMAL; IL3RA 




KAL2 


GDB:265288 


KALLMANN SYNDROME 2- KAL2 




KMS 


GDB: 11 8827 


SYNDROME; KMS 




KRT18 


GDB: 120127 


KERATIN 18; KRT1 8 


20 


KSS 


GDB:9957718 


KEARNS-SAYRE SYNDROME; KSS 




LCAT 


GDB: 119359 


FISH-EYE DISEASE; FED 

LECITHIN:CHOLESTEROL ACYLTRANSFERASE 
DEFICIENCY 




LIMM 


GDB:9958161 


MYOPATHY, MITOCHONDRIAL, LETHAL 
INFANTILE; LIMM 


25 


MANBB 


GDB: 125262 


MANNOSIDOSIS, BETA; MANB1 




MCPH2 


GDB:9863035 ' 


MICROCEPHALY- MCT 




MEB 


GDB: 599557 


DISEASE 




MELAS 


GDB:9955855 


MELAS SYNDROME 


30 


MIC2 j 


GDB:120184 


SURFACE ANTIGEN MIC2; MIC2; CD99 MIC2 
SURFACE ANTIGEN, Y-CHROMOSOMAL; MIC2Y 




MPFD 


GDB:439372 


CONGENITAL, WITH FIBER-TYPE 
DISPROPORTION 




MS 


GDB:229116 


SCLEROSIS; MS 


35 


MSS 


GDB: 118743 


MARINESCO-SJOGREN SYNDROME; MSS 
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MTATP6 


GDB: 118897 


ATP SYNTHASE 6; MTATP6 




MTCOl 


GDB: 118900 


COMPLEX IV, CYTOCHROME c OXIDASE 

QT TT>T TXTTT T- \ATT*f\1 - i~*f~\T 

oUoUlNll l; MILU1; CUl 








ti 1 Ul^iiXUiVLo C OAUJAoli 111; MTC03 




MTCYR 

IVA 1 X i-J 




f^fW/TDT PY TTT PVTHPUD /""MV /ft? 1* OT TOT TXTTT 1 

CUJYlrLrliA 111, ti lUUHKOJVLb b oUBUNll 




MTND1 


GDB: 118911 


COMPLEX I, SUBUNIT ND1 ; MTND1 


10 


MTND2 


GDB: 11 8912 


COMPLEX I, SUBUNIT ND2; MTND2 




MTND4 


GDB: 118914 
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The present invention is not to be limited in scope by the specific 



embodiments described herein. Indeed, various modifications of the invention in addition 
to those described will become apparent to those skilled in the art from the foregoing 
description and accompanying figures. Such modifications are intended to fall within the 
scope of the appended claims. 

Various publications are cited herein, the disclosures of which are 
incorporated by reference in their entireties. 
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The invention can be illustrated by the following embodiments enumerated 
in the numbered paragraphs that follow: 

1 . A method for identifying a test compound that modulates premature 
translation termination or nonsense-mediated mRNA decay, comprising the steps of (a) 
contacting a detectably labeled target RNA molecule with a library of solid support-attached 
test compounds under conditions that permit direct binding of the labeled target RNA to a 
member of the library of solid support-attached test compounds so that a detectably labeled 
target RNA: support-attached test compound complex is formed; (b) separating the 
detectably labeled target RNA:support-attached test compound complex formed in step (a) 
from uncomplexed target RNA molecules and test compounds, and (c) determining a 
structure of the test compound of the RNA:support-attached test compound complex. 

2. The method of paragraph 1 in which the target RNA molecule 
contains regions of 28S rRNA or analogs thereof. 



3. The method of paragraph 1 in which the detectably labeled RNA is 
labeled with a fluorescent dye, phosphorescent dye, ultraviolet dye, infrared dye, visible 
dye, radiolabel, enzyme, spectroscopic colorimetric label, affinity tag, or nanoparticle. 

4. The method of paragraph 1 in which the test compound is selected 
from a combinatorial library comprising peptoids; random bio-oligomers; diversomers such 
as hydantoins, benzodiazepines and dipeptides; vinylogous polypeptides; nonpeptidal 

25 peptidomimetics; oligocarbamates; peptidyl phosphonates; peptide nucleic acid libraries; 
antibody libraries; carbohydrate libraries; and small organic molecule libraries including, 
but not limited to, benzodiazepines, isoprenoids, thiazolidinones, metathiazanones, 
pyrrolidines, morpholino compounds, or diazepindiones. 



30 



5. The method of paragraph 1 in which screening a library of test 
compounds preferably comprises contacting the test compound with the target nucleic acid 
in the presence of an aqueous solution, the aqueous solution comprising a buffer and a 
combination of salts, preferably approximating or mimicking physiologic conditions 



35 
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6. The method of paragraph 5 in which the aqueous solution optionally 
further comprises non-specific nucleic acids comprising DNA, yeast tRNA, salmon sperm 
DNA, homoribopoiymers, and nonspecific RNA. 

7. The method of paragraph 5 in which the aqueous solution further 
comprises a buffer, a combination of salts, and optionally, a detergent or a surfactant. In 
another embodiment, the aqueous solution further comprises a combination of salts, from 
about 0 mM to about 100 mM KC1, from about 0 mM to about 1 M NaCl, and from about 0 

1Q mM to about 200 mM MgCl 2 . In a preferred embodiment, the combination of salts is about 
100 mM KC1, 500 mM NaCI, and 10 mM MgCl 2 . In another embodiment, the solution 
optionally comprises from about 0.01% to about 0.5% (w/v) of a detergent or a surfactant. 

8. Any method that detects an altered physical property of a target 
15 nucleic acid compiexed to a test compound attached to a solid support from the unbound 

target nucleic acid may be used for separation of the compiexed and non-complexed target 
nucleic acids in the method of paragraph 1 . Methods such as flow cytometry, affinity 
chromatography, manual batch mode separation, suspension of beads in electric fields, and 
microwave are used for the separation of the compiexed and non-complexed target nucleic 

20 acids ' 

9. The structure of the substantially one type of test compound of the 
RN Attest compound complex of paragraph 1 is determined, in part, by the type of library of 
test compounds. In a preferred embodiment wherein the combinatorial libraries are small 

25 organic molecule libraries, mass spectroscopy, NMR, or vibration spectroscopy are used to 
determine the structure of the test compounds. In an embodiment wherein the 
combinatorial libraries are peptide or peptide-based libraries, Edman degradation is used to 
determine the structure of the test compounds. 
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WHAT IS CLAIMED IS: 

1 . A method for identifying a test compound that modulates premature 
translation termination or nonsense-mediated mRNA decay, comprising the steps of: 

(a) contacting a detectably labeled target RNA molecule with a 
library of solid support-attached test compounds under 
conditions that permit direct binding of the labeled target 
RNA to a member of the library of solid support-attached test 
compounds so that a detectably labeled target RNA:support- 
attached test compound complex is formed; 

(b) separating the detectably labeled target RNA:support-attached 
test compound complex formed in step (a) from uncomplexed 
target RNA molecules and test compounds by flow cytometry; 
and 

(c) determining a structure of the substantially one type of test 
compound of the RNA.support-attached test compound 
complex by mass spectroscopy. 
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ABSTRACT 



The present invention relates to a method for screening and identifying test 



compounds that modulate premature translation termination and/or nonsense-mediated 
messenger ribonucleic acid ("mRNA") by interacting with a preselected target ribonucleic 
acid ("RNA"). In particular, the present invention relates to identifying test compounds that 
bind to regions of the 28S ribosomal RNA ("rRNA") and analogs thereof. Direct, non- 
competitive binding assays are advantageously used to screen bead-based libraries of 
compounds for those that selectively bind to a preselected target RNA. Binding of target 
RNA molecules to a particular test compound is detected using any physical method that 
measures the altered physical property of the target RNA bound to a test compound. The 
structure of the test compound attached to the labeled RNA is also determined. The 
methods used will depend, in part, on the nature of the library screened. The methods of the 
present invention provide a simple, sensitive assay for high-throughput screening of 
libraries of compounds to identify pharmaceutical leads. 
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Figure 1 
Sheet 1/6 
Attorney Docket No. 10589-023 
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Figure 2 
Sheet 2/6 
Attorney Docket No. 10589-023 
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Figure 3 
Sheet 3/6 
Attorney Docket No. 10589-023 




7. DMSO 




9. Compound E (J) 

10. Compound F (J) 

11. Compound G (I) 

12. Compound H (J) 



NY2-132048I.1 



6 0390Ti2 *07tH^Qa 



Figure 4 
Sheet 4/6 
Attorney Docket No. 10589-023 
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Figure 5 
Sheet 5/6 
Attorney Docket No. 10589-023 
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METHODS FOR IDENTIFYING SMALL MOLECULES THAT 
MODULATE PREMATURE TRANSLATION TERMINATION 
AND NONSENSE MEDIATED mRNA DECAY 



1. INTRODUCTION 

The present invention relates to a method for screening and identifying test 
compounds that modulate premature translation termination and/or nonsense-mediated 

10 messenger ribonucleic acid ("mRNA") decay by interacting with a preselected target 
ribonucleic acid ("RNA"). In particular, the present invention relates to identifying test 
compounds that bind to regions of the 28S ribosomal RNA ("rRNA") and analogs thereof. 
Direct, non-competitive binding assays are advantageously used to screen libraries of 
compounds for those that selectively bind to a preselected target RNA. Binding of target 

1 5 RNA molecules to a particular test compound is detected using any physical method that 
measures the altered physical property of the target RNA bound to a test compound. The 
methods of the present invention provide a simple, sensitive assay for high-throughput 
screening of libraries of compounds to identify pharmaceutical leads. 

20 2. BACKGROUND OF THE INVENTION 

Protein synthesis encompasses the processes of translation initiation, 
elongation, and termination, each of which has evolved to occur with great accuracy and has 
the capacity to be a regulated step in the pathway of gene expression. Recent studies, 
including those suggesting that events at termination may regulate the ability of ribosomes 

25 to recycle to the start site of the same mRNA, have underscored the potential of termination 
to regulate other aspects of translation. The RNA triplets UAA, UAG, and UGA are non- 
coding and promote translational termination. Termination starts when one of the three 
termination codons enters the A site of the ribosome signaling the polypeptide chain release 
factors to bind and recognize the termination signal. Subsequently, the ester bond between 

30 the 3' nucleotide of the transfer RNA ("tRNA") located in the ribosome' s P site and the 
nascent polypeptide chain is hydrolyzed, the completed polypeptide chain is released, and 
the ribosome subunits are recycled for another round of translation. 

Nonsense-mediated mRNA decay is a surveillance mechanism that 
minimizes the translation and regulates the RNA stability of nonsense RNAs that contain 

35 chain termination mutations (see, e.g., Hentze & Kulozik, 1999, Cell 96:307-3 10; 

Culbertson, 1999, Trends in Genetics 15:74-80; Li & Wilkinson, 1998, Immunity 8:135- 
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141; and Ruiz-Echevarria etal, 1996, Trends in Biological Sciences, 21:433-438). Chain 
termination mutations are caused when a base substitution or frameshift mutation changes a 
codon into a termination codon, i.e., a premature stop codon that causes translational 

^ termination. In nonsense-mediated mRNA decay, mRNAs with premature stop codons are 
frequently subject to degradation. A truncated protein is produced as a result of the 
translation apparatus prematurely terminating at the stop codon. 

Certain classes of known antibiotics have been characterized and found to 
interact with RNA. For example, the antibiotic thiostrepton binds tightly to a 60-mer from 

1Q ribosomal RNA (Cundliffe etal, 1990, in The Ribosome: Structure, Function & Evolution 
(Schlessinger et al, eds.) American Society for Microbiology, Washington, D.C. pp. 479- 
490). Bacterial resistance to various antibiotics often involves methylation at specific rRNA 
sites (Cundliffe, 1989, Ann. Rev. Microbiol. 43:207-233). Aminoglycoside aminocyclitol 
(aminoglycoside) antibiotics and peptide antibiotics are known to inhibit group I intron 

15 splicing by binding to specific regions of the RNA (von Ahsen et al. ,1991, Nature 
(London) 353:368-370). Some of these same aminoglycosides have also been found to 
inhibit hammerhead ribozyme function (Stage et aL 9 1995, RNA 1 :95-101). In addition, 
certain aminoglycosides and other protein synthesis inhibitors have been found to interact 
with specific bases in 16S rRNA (Woodcock et al, 1991, EMBO J. 10:3099-3103). An 

2o oligonucleotide analog of the 16S rRNA has also been shown to interact with certain 
aminoglycosides (Purohit et al, 1994, Nature 370:659-662). A molecular basis for 
hypersensitivity to aminoglycosides has been found to be located in a single base change in 
mitochondrial rRNA (Hutchin etal, 1993, Nucleic Acids Res. 21:4174-4179). 
Aminoglycosides have also been shown to inhibit the interaction between specific structural 

25 RNA motifs and ^ corresponding RNA binding protein. Zapp et al (Cell, 1993, 74:969- 
978) has demonstrated that the aminoglycosides neomycin B, lividomycin A, and 
tobramycin can block the binding of Rev, a viral regulatory protein required for viral gene 
expression, to its viral recognition element in the IIB (or RRE) region of HIV RNA. This 
blockage appears to be the result of competitive binding of the antibiotics directly to the 

3Q RRE RNA structural motif. 

Aminoglycosides have also been found to promote nonsense suppression 
(see, e.g., Bedwell et al. 9 1997, Nat. Med. 3:1280-1284 and Howard etaU 1996, Nat. Med. 
2:467-469). Nonsense mutations cause approximately 10 to 30 percent of the individual 
cases of virtually all inherited diseases. Although nonsense mutations inhibit the synthesis 
35 of a full-length protein to one percent or less of wild-type levels, minimally boosting the 
expression levels of the full-length protein to between five and fifteen percent of normal 
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levels can eliminate or greatly reduce the severity of the disease. Clinical approaches that 
target the translation termination event to promote nonsense suppression have recently been 
described for model systems of cystic fibrosis and muscular dystrophy. Gentamicin is an 
aminoglycoside antibiotic that causes translational misreading and allowed the insertion of 
an amino acid at the site of the nonsense codon in models of cystic fibrosis, Hurlers 
Syndrome, and muscular dystrophy (see, e.g., Barton-Davis et aL, 1999, J. Clin. Invest. 
104:375-381). These results strongly suggest that drugs that promote nonsense suppression 
by altering translation termination efficiency of a premature termination codon can be 
therapeutically valuable in the treatment of diseases caused by nonsense mutations. 

Citation or identification of any reference in Section 2 of this application is 
not an admission that such reference is available as prior art to the present invention. 



3. SUMMARY OF THF, INVENTION 

j ^ The present invention relates to methods for identifying compounds that 

modulate translation termination and/or nonsense-mediated mKNA decay by identifying 
compounds that bind to preselected target elements of nucleic acids including, but not 
limited to, specific RNA sequences, RNA structural motifs, and/or RNA structural 
elements. In particular, the present invention relates to identifying test compounds that bind 

2Q to regions of the 28S rKNA and analogs thereof. The specific target RNA sequences, RNA 
structural motifs, and/or RNA structural elements regions of the 28S rRNA and 
analogs thereof) are used as targets for screening small molecules and identifying those that 
direcdy bind these specific sequences, motifs, and/or structural elements. For example, 
methods are described in which a preselected target RNA having a detectable label is used 

2 ^ to screen a library of test compounds, preferably under physiologic conditions. Any 

complexes formed between the target RNA and a member of the library are identified using 
physical methods that detect the altered physical property of the target RNA bound to a test 
compound. In particular, the present invention relates to methods for using a target RNA 
having a detectable label to screen a library of test compounds free in solution, in labeled 

30 tubes or microtiter plate, or in a microarray. Compounds in the library that bind to the 
labeled target RNA will form a detectably labeled complex. The detectably labeled 
complex can then be identified and removed from the uncomplexed, unlabeled test 
compounds in the library, and from uncomplexed, labeled target RNA, by a variety of 
methods, including but not limited to, methods that differentiate changes in the 

35 electrophoretic, chromatographic, or thermostable properties of the complexed target RNA. 
Such methods include, but are not limited to, electrophoresis, fluorescence spectroscopy, 
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surface plasmon resonance, mass spectrometry, scintillation proximity assay, structure- 
activity relationships ("SAR") by NMR spectroscopy, size exclusion chromatography, 
affinity chromatography, and nanoparticle aggregation. The structure of the test compound 

^ attached to the labeled RNA is then determined. The methods used will depend, in part, on 
the nature of the library screened. For example, assays or microarrays of test compounds, 
each having an address or identifier, may be deconvoluted, e.g., by cross-referencing the 
positive sample to original compound list that was applied to the individual test assays. 
Another method for identifying test compounds includes de novo structure determination of 

^ the test compounds using mass spectrometry or nuclear magnetic resonance ("NMR"). The 
test compounds identified are useful for any purpose to which a binding reaction may be 
put, for example in assay methods, diagnostic procedures, cell sorting, as inhibitors of target 
molecule function, as probes, as sequestering agents and the like. In addition, small organic 
molecules which interact specifically with target RNA molecules may be useful as lead 

j ^ compounds for the development of therapeutic agents. 

The methods described herein for the identification of compounds that 
directly bind to 28S rRNA are well suited for high-throughput screening. The direct 
binding method of the invention offers advantages over drug screening systems for 
competitors that inhibit the formation of naturally-occurring RNA binding proteinrtarget 

2Q RNA complexes; competitive assays. The direct binding method of the invention is 
rapid and can be set up to be readily performed, e.g., by a technician, making it amenable to 
high-throughput screening. The method of the invention also eliminates the bias inherent in 
the competitive drug screening systems, which require the use of a preselected host cell 
factor that may not have physiological relevance to the activity of the target RNA. Instead, 

2j the methods of the invention are used to identify any compound that can directly bind to 
28S rRNA preferably under physiologic conditions. As a result, the compounds so 
identified can inhibit the interaction of the target RNA with any one or more of the native 
host cell factors (whether known or unknown) required for activity of the RNA in vivo. 

The present invention may be understood more fully by reference to the 
detailed description and examples, which are intended to illustrate non-limiting 
embodiments of the invention. 

3.1. Definitions 

As used herein, a "target nucleic acid" refers to RNA, DNA, or a chemically 
2 5 modified variant thereof. Tn a preferred embodiment, the target nucleic acid is RNA. A 
target nucleic acid also refers to tertiary structures of the nucleic acids, such as, but not 
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limited to loops, bulges, pseudoknots, guanosine quartets and turns. A target nucleic acid 
also refers to RNA elements such as, but not limited to, 28S rRNA and structural analogs 
thereof, which are described in Section 5.1 . Non-limiting examples of target nucleic acids 

5 are presented in Section 5. 1 

As used herein, a "library" refers to a plurality of test compounds with which 
a target nucleic acid molecule is contacted. A library can be a combinatorial library, e.g., a 
collection of test compounds synthesized using combinatorial chemistry techniques, or a 
collection of unique chemicals of low molecular weight (less than 1000 daltons) that each 
j q occupy a unique three-dimensional space. 

As used herein, a "label" or "detectable label" is a composition that is 
detectable, either directly or indirectly, by spectroscopic, photochemical, biochemical, 
immunochemical, or chemical means. For example, useful labels include radioactive 
isotopes (e.g., 32 P, 35 S, and 3 H), dyes, fluorescent dyes, electron-dense reagents, enzymes 

j 5 and their substrates (e.g. , as commonly used in enzyme-linked immunoassays, e.g. , alkaline 
phosphatase and horse radish peroxidase), biotin-streptavidin, digoxigenin, or haptens and 
proteins for which antisera or monoclonal antibodies are available. Moreover, a label or 
detectable moiety can include a "affinity tag" that, when coupled with the target nucleic acid 
and incubated with a test compound or compound library, allows for the affinity capture of 

2Q the target nucleic acid along with molecules bound to the target nucleic acid. One skilled in 
the art will appreciate that a affinity tag bound to the target nucleic acids has, by definition, 
a complimentary ligand coupled to a solid support that allows for its capture. For example, 
useful affinity tags and complimentary partners include, but are not limited to, 
biotin-streptavidin, complimentary nucleic acid fragments (e.g., oligo dT-oligo dA, oligo 

25 T-oiigo A, oligo dG-oligo dC, oligo G-oligo C), aptamers, or haptens and proteins for which 
antisera or monoclonal antibodies are available. The label or detectable moiety is typically 
bound, either covalently, through a linker or chemical bound, or through ionic, van der 
Waals or hydrogen bonds to the molecule to be detected. 

As used herein, a "dye" refers to a molecule that, when exposed to radiation, 

3Q emits radiation at a level that is detectable visually or via conventional spectroscopic means. 
As used herein, a "visible dye" refers to a molecule having a chromophore that absorbs 
radiation in the visible region of the spectrum (i.e., having a wavelength of between about 
400 nm and about 700 nm) such that the transmitted radiation is in the visible region and 
can be detected either visually or by conventional spectroscopic means. As used herein, an 

35 "ultraviolet dye" refers to a molecule having a chromophore that absorbs radiation in the 
ultraviolet region of the spectrum (/.<?., having a wavelength of between about 30 nm and 
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about 400 nm). As used herein, an "infrared dye" refers to a molecule having a 
chromophore that absorbs radiation in the infrared region of the spectrum (f.e, having a 
wavelength between about 700 nm and about 3,000 nm). A "chromophore" is the network 
of atoms of the dye that, when exposed to radiation, emits radiation at a level that is 
detectable visually or via conventional spectroscopic means. One of skill in the art will 
readily appreciate that although a dye absorbs radiation in one region of the spectrum, it 
may emit radiation in another region of the spectrum. For example, an ultraviolet dye may 
emit radiation in the visible region of the spectrum. One of skill in the art will also readily 
appreciate that a dye can transmit radiation or can emit radiation via fluorescence or 
phosphorescence. 



is not limited to salts of acidic or basic groups that may be present in test compounds 
identified using the methods of the present invention. Test compounds that are basic in 
nature are capable of forming a wide variety of salts with various inorganic and organic 
acids. The acids that can be used to prepare pharmaceutical^ acceptable acid addition salts 
of such basic compounds are those that form non-toxic acid addition salts, i.e.* salts 
containing pharmacologically acceptable anions, including but not limited to sulfuric, citric, 
maleic, acetic, oxalic, hydrochloride, hydrobromide, hydroiodide, nitrate, sulfate, bisulfate, 
phosphate, acid phosphate, isonicotinate, acetate, lactate, salicylate, citrate, acid citrate, 
tartrate, oleate, tannate, pantothenate, bitartrate, ascorbate, succinate, maleate, gentisinate, 
fiimarate, gluconate, glucaronate, saccharate, formate, benzoate, glutamate, 
methanesulfonate, ethanesulfonate, benzenesulfonate, p-toluenesulfonate and pamoate (Le. 9 
l,l f -methylene-bis-(2-hydroxy-3-naphthoate)) salts. Test compounds that include an amino 
moiety may form pharmaceutically or cosmetically acceptable salts with various amino 
acids, in addition to the acids mentioned above. Test compounds that are acidic in nature 
are capable of forming base salts with various pharmacologically or cosmetically acceptable 
cations. Examples of such salts include alkali metal or alkaline earth metal salts and, 
particularly, calcium, magnesium, sodium lithium, zinc, potassium, and iron salts. 



the assay can be performed in such a fashion that at some point, only one compound need be 
used in each reaction so that, if the result is indicative of a binding event occurring between 
the target RNA molecule and the test compound, the test compound can be easily identified. 



The phrase "pharmaceutically acceptable salt(s)," as used herein includes but 



30 



By "substantially one type of test compound," as used herein, is meant that 



35 



FIG. 1. 



4. DESCRIPTION OF DRAWINGS 

The human 28S rRNA. Domains II and V are circled. 
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Gel retardation analysis to detect peptide-RNA interactions. In 20 yl 
reactions containing increasing concentrations of Tat 47 . 58 peptide (0. 1 jxM, 
0.2 fiM, 0.4 ^M, 0.8 |nM, 1.6 jiM) 50 pmole TAR RNA oligonucleotide was 
added in TK buffer. The reaction mixture was then heated at 90 °C for 2 min 
and allowed to cool slowly to 24 °C. 10 ml of 30% glycerol was added to 
each sample and applied to a 12% non-denaturing polyacrylamide gel. The 
gel was electrophoresed using 1200 volt-hours at 4°C in TBE Buffer. 
Following electrophoresis, the gel was dried and the radioactivity was 
quantitated with a phosphorimager. The concentration of peptide added is 
indicated above each lane. 

Gentamicin interacts with an oligonucleotide corresponding to the 1 6S 
rRNA. 20 |xl reactions containing increasing concentrations of gentamicin (1 ' 
ng/ml, 10 ng/ml, 100 ng/ml, 1 ng/ml, 10 jag/ml, 50 \xg/ml, 500 jig/ml) were 
added to 50 pmole RNA oligonucleotide in TKM buffer, heated at 90 °C for 
2 min and allowed to cool slowly to 24 °C. Then 1 0 \il of 30% glycerol was 
added to each sample and the samples were applied to a 13.5% non- 
denaturing polyacrylamide gel. The gel was electrophoresed using 1200 
volt-hours at 4°C in TBE Buffer. Following electrophoresis, the gel was 
dried and the radioactivity was quantitated using a phosphorimager. The 
concentration of gentamicin added is indicated above each lane. 
The presence of 10 pg/ml gentamicin produces a gel mobility shift in the 
presence of the 16S rRNA oligonucleotide. 20 fil reactions containing 
increasing concentrations of gentamicin (100 ng/ml, 10 ng/ml, 1 ng/ml, 100 
pg/ml, and 10 pg/ml) were added to 50 pmole RNA oligonucleotide in TKM 
buffer were treated as described for Figure 3. 

Gentamicin binding to the 16S rRNA oligonucleotide is weak in the absence 
of MgCl 2 . Reaction mixtures containing gentamicin (1 mg/ml, 100 ng/ml, 
10 ng/ml,l ng/ml, 0.1 ng/ml, and 10 ng/ml) were treated as described in 
Figure 3 except that the TKM buffer does not contain MgCl 2 . 
Gel retardation analysis to detect peptide-RNA interactions. In reactions 
containing increasing concentrations of Tat 47 . 58 peptide (0.1 jiM, 0.2 jxM, 0.4 
jxM, 0.8 |nM, 1 .6 |iM) 50 pmole TAR RNA oligonucleotide was added in TK 
buffer. The reaction mixture was then heated at 90 °C for 2 min and allowed 
to cool slowly to 24°C. The reactions were loaded onto a SCE9610 
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automated capillary electrophoresis apparatus (SpectruMedix; State College, 
Pennsylvania). The peaks correspond to the amount of free TAR RNA 
("TAR") or the Tat-TAR complex ("Tat-TAR"). The concentration of 
peptide added is indicated below each lane. 

Small molecules involved in nonsense suppression alter the chemical 
footprinting pattern in Domain V of the 28S rRNA. 1 00 pmol of ribosomes 
were incubated with 1 00 \xM compound, followed by treatment with the 
chemical modifying agents kethoxal (KE) and dimethyl sulfate (DMS, not 
shown). Following chemical modification, rRNA was prepared and analyzed 
in primer extension reactions using end-labeled oligonucleotides hybridizing 
to rRNA. A sequencing reaction was run in parallel as a marker. 
Small molecules involved in nonsense suppression alter the chemical 
footprinting pattern in Domain V of the 28S rRNA. 100 pmol of ribosomes 
were incubated with 1 00 jxM compound, followed by treatment with the 
chemical modifying agents kethoxal (KE) and dimethyl sulfate (DMS, not 
shown). Following chemical modification, rRNA was prepared and analyzed 
in primer extension reactions using end-labeled oligonucleotides hybridizing 
to rRNA. A sequencing reaction was run in parallel as a marker. 
Small molecules involved in nonsense suppression alter the chemical 
footprinting pattern in Domain II (GTPase Center) of the 28S rRNA. 100 
pmol of ribosomes were incubated with 100 ^iM compound, followed by 
treatment with the chemical modifying agents kethoxal (KE) and dimethyl 
sulfate (DMS, not shown). Following chemical modification, rRNA was 
prepared and analyzed in primer extension reactions using end-labeled 
oligonucleotides hybridizing to rRNA. A sequencing reaction was run in 
parallel as a marker. 

Small molecules involved in nonsense suppression alter the chemical 
footprinting pattern of domain II of the 28S rRNA. 1 00 pmol of ribosomes 
were incubated with 100 ^iM compound, followed by treatment with 
chemical modifying agents dimethyl sulfate (DMS) and kethoxal (KE). 
Following chemical modification, rRNA was prepared and analyzed in 
primer extension reactions using end-labeled oligonucleotides hybridizing to 
rRNA. A sequencing reaction was run in parallel as a marker. 
A specific region of Domain II can compete for compound binding and 
prevents nonsense suppression in vitro. The in vitro nonsense suppression 
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assay was performed using a luciferase construct with a UGA nonsense 
mutation. 0.1 jxM compound was present in the reaction to induce nonsense 
suppression. Competitor RNA corresponding to Domain II was added at the 
5 indicated concentrations (0, 1, 2.5, 5, 7.5, 10 pM) to titrate the small 

molecule and prevent nonsense suppression. 

5. DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to methods for identifying compounds that bind 

10 to preselected target elements of nucleic acids, in particular, RNAs, including but not 
limited to preselected target RNA sequencing structural motifs, or structural elements. In 
particular, the present invention relates to identifying test compounds that bind to regions of 
the 28S rRNA and analogs thereof. Methods are described in which a preselected target 
RNA having a detectable label is used to screen a library of test compounds. Any 

15 complexes formed between the target RNA and a member of the library are identified using 
physical methods that detect the altered physical property of the target RNA bound to a test 
compound. Changes in the physical property of the RNA-test compound complex relative 
to the target RNA or test compound can be measured by methods such as, but not limited to, 
methods that detect a change in mobility due to a change in mass, change in charge, or a 

2Q change in thermostability. Such methods include, but are not limited to, electrophoresis, 
fluorescence spectroscopy, surface plasmon resonance, mass spectrometry, scintillation, 
proximity assay, structure-activity relationships ("SAR") by NMR spectroscopy, size 
exclusion chromatography, affinity chromatography, and nanoparticle aggregation. In 
particular, the present invention relates to methods for using a target RNA having a 

25 detectable label to screen a library of test compounds free in solution, in labeled tubes or 
microliter plate, or in a microarray. Compounds in the library that bind to the labeled target 
RNA will form a detectably labeled complex. The detectably labeled complex can then be 
identified and removed from the unlabeled, uncomplexed test compounds in the library by a 
variety of methods capable of differentiating changes in the physical properties of the 

3Q compiexed target RNA. The structure of the test compound attached to the labeled RNA is 
also determined. The methods used will depend, in part, on the nature of the library 
screened. For example, assays or microarrays of test compounds, each having an address or 
identifier, may be deconvolved, e.g., by cross-referencing the positive sample to an original 
compound list that was applied to the individual test assays. Another method for identifying 
3 5 test compounds includes de novo structure determination of the test compounds using mass 
spectrometry or nuclear magnetic resonance ("NMR"). 
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Thus, the methods of the present invention provide a simple, sensitive assay 
for high-throughput screening of libraries of test compounds, in which the test compounds 
of the library that specifically bind a preselected target nucleic acid are easily distinguished 
from non-binding members of the library. The structures of the binding molecules are 
deciphered from the input library by methods depending on the type of library that is used. 
The test compounds so identified are useful for any purpose to which a binding reaction 
may be put, for example in assay methods, diagnostic procedures, cell sorting, as inhibitors 
of target molecule function, as probes, as sequestering agents and lead compounds for 
development of therapeutics, and the like. Small organic compounds that are identified to 
interact specifically with the target RNA molecules are particularly attractive candidates as 
lead compounds for the development of therapeutic agents. 

The assay of the invention reduces bias introduced by competitive binding 
assays which require the identification and use of a host cell factor (presumably essential for 
modulating RNA function) as a binding partner for the target RNA. The assays of the 
present invention are designed to detect any compound or agent that binds to 28S rRNA, 
preferably under physiologic conditions. Such agents can then be tested for biological 
activity, without establishing or guessing which host cell factor or factors is required for 
modulating the function and/or activity of 28S rRNA. 

Section 5.1 describes examples of 28S rRNA and analogs thereof that can be 
used as preselected target RNAs. Section 5.2 describes detectable labels for target nucleic 
acids that are useful in the methods of the invention. Section 5.3 describes libraries of test 
compounds. Section 5.4 provides conditions for binding a labeled target RNA to a test 
compound of a library and detecting RNA binding to a test compound using the methods of 
the invention. Section 5.5 provides methods for separating complexes of target RNAs 
bound to a test compound from an unbound RNA. Section 5.6 describes methods for 
identifying test compounds that are bound to the target RNA. Section 5.7 describes a 
secondary, biological screen of test compounds identified by the methods of the invention to 
test the effect of the test compounds in vivo. Section 5.8 describes the use of test 
compounds identified by the methods of the invention for treating or preventing a disease or 
abnormal condition in mammals. 

5.1. 28S rRNA and Analogs Thereof 

The ribosome is a 2.5-MDa ribonucleoprotein complex involved in the 
decoding of genetic material from mRNA to proteins. A combination of biophysical and 
biochemical analysis have provided three dimensional models of the ribosome as well as 
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detailed analyses into the mechanism of the individual steps in translation (see, e.g., Green 
& Noller, 1997, Annu. Rev. Biochem. 66:679-716; Cate etaL, 1999, Science 
285(5436):2095-2104; and Ban et al., 2000, Science.289(5481): 905-920.). 

The 28S rRNA is one of the ribosomal RNA components of the 60S subunit 
of eucaryotic ribosomes. The 28S rRNA sequences are conserved when expressed as 
mature rRNAs, although the 28S rRNA contains variable sequence tracts that are 
interspersed among conserved core sequences and lacking in the counterpart bacterial 23 S 
rRNA (see, e.g., Hancock & Dover, 1988, Mol. Biol. EvoL 5:377-391). A diagram of the 
28S rRNA is presented in Figure 1, with domains II and V circled. As indicated in Figure 1, 
a GTPase center has been mapped to domain II and the peptidyl transferase center has been 
mapped to domain V. 

Compounds that interact in these regions or modulate local changes within 
these domains of the ribosome (e.g., alter base pairing interactions, basfc modification or 
modulate binding of trans-acting factors that bind to these regions) have the potential to 
modulate translation terminati6n. These regions, i.e., domains II and V are conserved from 
prokaryotes to eukaryotes, but the role of these regions in modulating translation 
termination has not been realized in eukaryotes. In bacteria, when a short RNA fragment, 
complementary to the E. coli 23S rRNA segment comprising nucleotides 735 to 766 (in 
domain II), is expressed in vivo, suppression of UGA nonsense mutations, but not UAA or 
UAG, results (Chernyaeva et al, 1999, J Bacterid 181 :5257-5262). Other regions of the 
23 S rRNA in E. coli have been implicated in nonsense suppression including the GTPase 
center in domain II (nt 1034-1 120; Jemiolo et al, 1995, Proc. Nat. Acad. Sci. 
92:12309-12313). 

Genetic mutations in bacteria have also identified rRNA mutations that 
either increase the level of frameshifting in the trpE or the suppression of a nonsense 
mutations in the trpA gene (reviewed in Green & Noller, 1997, Annu. Rev. Biochem. 
66:679-716). The frameshifting mutations mapped to domains IV and V of the 23S rRNA. 
Disruption of the interaction of the CCA end of the tRNA with the peptidyl transferase 
center of the ribosome has been demonstrated to result in an increased translational error 
frequency (reviewed in Green & Noller, 1997, Annu. Rev. Biochem. 66:679-716). 

Regions of the 28S rRNA involved in frameshifting, nonsense mutation 
suppression, GTPase activity, or peptidyl transferase are attractive target RNAs to identify 
compounds that modulate premature translation termination and/or nonsense mediated 
decay. The interference of a test compound with one or more of these functions could 
potentially mediate translation termination by interfering with premature translation 
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termination. Without being bound by theory, a test compound could potentially mediate 
translation termination by causing readthrough of a premature translation codon, therefore 
allowing the synthesis of the full-length protein. 

In a preferred embodiment, the target RNA comprises a region of 28S rRNA 
corresponding to domain II (see, e.g., nucleotides 1310 to 2333 of accession number 
Ml 1 167) or domain V of 28S rRNA (see, e.g., nucleotides 3859 to 4425 of accession 
number Ml 1 167) or an analog thereof. It will become apparent to one of skill in the art that 
an analog of the 28S rRNA has an analogous structure and function to native 28S rRNA. 
For example, an analog of human 28S rRNA includes, but is not limited to, a human 28S 
rRNA retropseudogene (see, e.g., Wang et aL, 1997, Gene 196:105-1 11, Accession Number 
L20636). Regions corresponding to domain II or domain V of the 28S rRNA pseudogene 
could be used as target RNAs in the present invention. In a preferred embodiment, the 28S 
rRNA is a human 28S rRNA, although the teachings of the present invention are applicable 
to mammals. 

Synthesis of the target RNAs, i.e., regions of 28S rRNA, can be performed 
by methods known to one of skill in the art (see, e.g., Sambrook etal, 1989, Molecular 
Cloning, A Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, New York and Glover, D.M. (ed.), 1985, DNA Cloning: A Practical Approach, 
MRL Press, Ltd., Oxford, U.K. Vol. I, II). In a preferred embodiment, the target RNAs are 
cloned as DNAs downstream of a promoter, such as but not limited to T7, T3, or Sp6 
promoters, and in vitro transcribed with the corresponding polymerase. A detectable label 
can be incorporated into the in vitro transcribed RNA or alternatively, the target RNA is 
end-labeled (see Section 5.2 infra). Alternatively, the target RNA can be amplified by 
polymerase chain reaction with a primer containing an RNA promoter and subsequently in 
vitro transcribed, as described in U.S. Patent No. 6,271, 002, which is incorporated by 
reference in its entirety. 



5.2. Detectablv Labeled Target RNAs 
2 0 Target nucleic acids, including but not limited to RNA and DNA, useful in 

the methods of the present invention have a label that is detectable via conventional 
spectroscopic means or radiographic means. Preferably, target nucleic acids are labeled 
with a covalently attached dye molecule. Useful dye-molecule labels include, but are not 
limited to, fluorescent dyes, phosphorescent dyes, ultraviolet dyes, infrared dyes, and visible 
^ dyes. Preferably, the dye is a visible dye. 
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Useful labels in the present invention can include, but are not limited to, 
spectroscopic labels such as fluorescent dyes (e.g., fluorescein and derivatives such as 
fluorescein isothiocyanate (FITC) and Oregon Green™, rhodamine and derivatives (e.g., 

^ Texas red, tetramethylrhodiraine isothiocynate (TRTTC), bora-3a,4a-diaza-s-indacene 
(BODIPY®) and derivatives, etc.), digoxigenin, biotin, phycoerythrin, AMCA, CyDye™, 
and the like), radiolabels (e.g., 3 H, I25 I, 35 S, l4 C, 32 P, 33 P, etc.), enzymes (e.g., horse radish 
peroxidase, alkaline phosphatase etc.), spectroscopic colorimetric labels such as colloidal 
gold or colored glass or plastic (e.g. polystyrene, polypropylene, latex, etc.) beads, or 
nanoparticles - nanoclusters of inorganic ions with defined dimension from 0.1 to 1000 nm. 
Useful affinity tags and complimentary partners include, but are not limited to, 
biotin-streptavidin, complimentary nucleic acid fragments (e.g., oligo dT-oligo dA, oligo 
T-oligo A, oligo dG-oligo dC, oligo G-oligo C), aptamer-streptavidin, or haptens and 
proteins for which antisera or monoclonal antibodies are available. The label may be 

^ coupled directly or indirectly to a component of the detection assay (e.g., the detection 
reagent) according to methods well known in the art. A wide variety of labels may be used, 
with the choice of label depending on sensitivity required, ease of conjugation with the 
compound, stability requirements, available instrumentation, and disposal provisions. 

In one embodiment, nucleic acids that are labeled at one or more specific 
locations are chemically synthesized using phosphoramidite or other solution or solid-phase 
methods. Detailed descriptions of the chemistry used to form polynucleotides by the 
phosphoramidite method are well known (see, e.g., Caruthers et aL, U.S. Pat. Nos. 
4,458,066 and 4,415,732; Caruthers et aL, 1982, Genetic Engineering 4:1-17; Users Manual 
Model 392 and 394 Polynucleotide Synthesizers, 1990, pages 6-1 through 6-22, Applied 

25 Biosystems, Part No. 901237; Ojwang, et al, 1997, Biochemistry, 36:6033-6045). The 
phosphoramidite method of polynucleotide synthesis is the preferred method because of its 
efficient and rapid coupling and the stability of the starting materials. The synthesis is 
performed with the growing polynucleotide chain attached to a solid support, such that 
excess reagents, which are generally in the liquid phase, can be easily removed by washing, 

2 0 decanting, and/or filtration, thereby eliminating the need for purification steps between 
synthesis cycles. 

The following briefly describes illustrative steps of a typical polynucleotide 
synthesis cycle using the phosphoramidite method. First, a solid support to which is 
attached a protected nucleoside monomer at its 3' terminus is treated with acid, e.g., 
^ trichloroacetic acid, to remove the 5'-hydroxyl protecting group, freeing the hydroxyl group 
for a subsequent coupling reaction. After the coupling reaction is completed an activated 
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intermediate is formed by contacting the support-bound nucleoside with a protected 
nucleoside phosphoramidite monomer and a weak acid, e.g., tetrazole. The weak acid 
protonates the nitrogen atom of the phosphoramidite forming a reactive intermediate. 
Nucleoside addition is generally complete within 30 seconds. Next, a capping step is 
performed, which terminates any polynucleotide chains that did not undergo nucleoside 
addition. Capping is preferably performed using acetic anhydride and 1-methylimidazole. 
The phosphite group of the intemucleotide linkage is then converted to the more stable 
phosphotriester by oxidation using iodine as the preferred oxidizing agent and water as the 
oxygen donor. After oxidation, the hydroxyl protecting group of the newly added 
nucleoside is removed with a protic acid, e.g., trichloroacetic acid or dichloroacetic acid, 
and the cycle is repeated one or more times until chain elongation is complete. After 
synthesis, the polynucleotide chain is cleaved from the support using a base, e.g. , 
ammonium hydroxide or t-butyl amine. The cleavage reaction also removes any phosphate 
protecting groups, e.g., cyanoethyl. Finally, the protecting groups on the exocyclic amines 
of the bases and any protecting groups on the dyes are removed by treating the 
polynucleotide solution in base at an elevated temperature, e.g., at about 55°C. Preferably 
the various protecting groups are removed using ammonium hydroxide or t-butyl amine. 



standard phosphoramidite chemistry methods (Hwang et al. 9 1999, Proc. Natl. Acad. Sci. 
USA 96(23):12997-13002; Ojwang etal 9 1997, Biochemistry. 36:6033-6045 and references 
cited therein). Dye molecules useful for covalently coupling to phosphoramidites preferably 
comprise a primary hydroxyl group that is not part of the dye's chromophore. Illustrative 
dye molecules include, but are not limited to, disperse dye CAS 4439-31-0, disperse dye 
CAS 6054-58-6, disperse dye CAS 4392-69-2 (Sigma-Aldrich, St. Louis, MO), disperse 
red, and 1-pyrenebutanol (Molecular Probes, Eugene, OR). Other dyes useful for coupling 
to phosphoramidites will be apparent to those of skill in the art, such as fluorescein, cy3, 
and cy5 fluorescent dyes, and may be purchased from, e.g., Sigma- Aldrich, St. Louis, MO 
or Molecular Probes, Inc., Eugene, OR. 



enzymatically using in vitro transcription (Hwang et al 9 1999, Proc. Natl. Acad. Sci. USA 
96(23): 12997-13002 and references cited therein). In this embodiment, a template DNA is 
denatured by heating to about 90°C and an oligonucleotide primer is annealed to the 
template DNA, for example by slow-cooling the mixture of the denatured template and the 
primer from about 90°C to room temperature. A mixture of ribonucleoside-5 '-triphosphates 
capable of supporting template-directed enzymatic extension of the primed template {e.g., a 



Any of the nucleoside phosphoramidite monomers can be labeled using 



30 



In another embodiment, dye-labeled target RNA molecules are synthesized 
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mixture including GTP, ATP, CTP, and UTP), including one or more dye-labeled 
ribonucleotides (Sigma-Aldrich, St Louis, MO), is added to the primed template. Next, a 
polymerase enzyme is added to the mixture under conditions where the polymerase enzyme 
is active, which are well-known to those skilled in the art. A labeled polynucleotide is 
formed by the incorporation of the labeled ribonucleotides during polymerase-mediated 
strand synthesis. 

In yet another embodiment of the invention, nucleic acid molecules are end- 
labeled after their synthesis. Methods for labeling the 5'-end of an oligonucleotide include 
but are by no means limited to: (i) periodate oxidation of a 5'-to-5'-coupled ribonucleotide, 
followed by reaction with an amine-reactive label (Heller & Morisson, 1985, in Rapid 
Detection and Identification of Infectious Agents, D.T. Kingsbury and S. Falkow, eds., pp. 
245-256, Academic Press); (ii) condensation of ethylenediamine with 5'-phosphorylated 
polynucleotide, followed by reaction with an amine reactive label (Morrison, European 
Patent Application 232 967); (iii) introduction of an aliphatic amine substituent using an 
aminohexyl phosphite reagent in solid-phase DNA synthesis, followed by reaction with an 
amine reactive label (Cardullo et aL 9 1988, Proc. Natl. Acad. Sci. USA 85:8790-8794); and 
(iv) introduction of a thiophosphate group on the 5' -end of the nucleic acid, using 
phosphatase treatment followed by end-labeling with ATP-7S and kinase, which reacts 
specifically and efficiently with maleimide-labeled fluorescent dyes (Czworkowski et aL, 
1991, Biochem. 30:4821-4830). 

A detectable label should not be incorporated into a target nucleic acid at the 
specific binding site at which test compounds are likely to bind, since the presence of a 
covalently attached label might interfere sterically or chemically with the binding of the test 
compounds at this site. Accordingly, if the region of the target nucleic acid that binds to a 
host cell factor is known, a detectable label is preferably incorporated into the nucleic acid 
molecule at one or more positions that are spatially or sequentially remote from the binding 
region. 

After synthesis, the labeled target nucleic acid can be purified using standard 
techniques known to those skilled in the art (see Hwang et al 9 1999, Proc. Natl. Acad. Sci. 
USA 96(23):12997-13002 and references cited therein). Depending on the length of the 
target nucleic acid and the method of its synthesis, such purification techniques include, but 
are not limited to, reverse-phase high-performance liquid chromatography ("reverse-phase 
HPLC"), fast performance liquid chromatography ("FPLC"), and gel purification. After 
purification, the target RNA is refolded into its native conformation, preferably by heating 
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to approximately 85-95°C and slowly cooling to room temperature in a buffer, e.g., a buffer 
comprising about 50 mM Tris-HCl, pH 8 and 100 mM NaCl. 

In another embodiment, the target nucleic acid can also be radiolabeled. A 
radiolabel, such as, but not limited to, an isotope of phosphorus, sulfur, or hydrogen, may be 
incorporated into a nucleotide, which is added either after or during the synthesis of the 
target nucleic acid Methods for the synthesis and purification of radiolabeled nucleic acids 
are well known to one of skill in the art See, e.g., Sambrook et al, 1989, in Molecular 
Cloning: A Laboratory Manual, pp 10.2-10.70, Cold Spring Harbor Laboratory Press, and 
the references cited therein, which are hereby incorporated by reference in their entireties. 

In another embodiment, the target nucleic acid can be attached to an 
inorganic nanoparticle. A nanoparticle is a cluster of ions with controlled size from 0. 1 to 
1 000 nm comprised of metals, metal oxides, or semiconductors including, but not limited to 
Ag 2 S, ZnS, CdS, CdTe, Au, or Ti0 2 . Nanoparticles have unique optical, electronic and 
catalytic properties relative to bulk materials which can be adjusted according to the size of 
the particle. Methods for the attachment of nucleic acids are well known to one of skill in 
the art (see, e.g., Niemeyer, 2001, Angew. Chem. Int. Ed. 40: 4129-4158, International 
Patent Publication WO/021 8643, and the references cited therein, the disclosures of which 
are hereby incorporated by reference in their entireties). 

20 

5.3. Libraries of Small Molecules 

Libraries screened using the methods of the present invention can comprise a 
variety of types of test compounds. In some embodiments, the test compounds are nucleic 
acid or peptide molecules. In a non-limiting example, peptide molecules can exist in a 
phage display library. In other embodiments, types of test compounds include, but are not 
limited to, peptide analogs including peptides comprising non-naturally occurring amino 
acids, e.g., D-amino acids, phosphorous analogs of amino acids, such as a-amino 
phosphoric acids and a-amino phosphoric acids, or amino acids having non-peptide 
linkages, nucleic acid analogs such as phosphorothioates and PNAs, hormones, antigens, 
synthetic or naturally occurring drugs, opiates, dopamine, serotonin, catecholamines, 
thrombin, acetylcholine, prostaglandins, organic molecules, pheromones, adenosine, 
sucrose, glucose, lactose and galactose. Libraries of polypeptides or proteins can also be 
used. 

In a preferred embodiment, the combinatorial libraries are small organic 
^ molecule libraries, such as, but not limited to, benzodiazepines, isoprenoids, 
thiazolidinones, metathiazanones, pyrrolidines, morpholino compounds, and 



25 



30 



-16- 



NY2- 1320351.1 



10 



15 



20 



25 



30 



35 



diazepindiones. In another embodiment, the combinatorial libraries comprise peptoids; 
random bio-oligomers; diversomers such as hydantoins, benzodiazepines and dipeptides; 
vinylogous polypeptides; nonpeptidal peptidomimetics; oligocarbamates; peptidyl 
phosphonates; peptide nucleic acid libraries; antibody libraries; or carbohydrate libraries. 
Combinatorial libraries are themselves commercially available (see, e.g., Advanced 
ChemTech Europe Ltd., Cambridgeshire, UK; ASINEX, Moscow Russia; BioFocus pic, 
Sittingbourne, UK; Bionet Research (A division of Key Organics Limited ), Camelford, 
UK; ChemBridge Corporation, San Diego, California; ChemDiv Inc, San Diego, 
California.; ChemRx Advanced Technologies, South San Francisco, California; ComGenex 
Inc., Budapest, Hungary; Evotec OAI Ltd, Abingdon, UK; IF LAB Ltd., Kiev, Ukraine; 
Maybridge pic, Cornwall, UK; PharmaCore, Inc., North Carolina; SIDDCO Inc, Tucson, 
Arizona; TimTec Inc, Newark, Delaware; Tripos Receptor Research Ltd, Bude, UK; Toslab, 
Ekaterinburg, Russia). 

In one embodiment, the combinatorial compound library for the methods of 
the present invention may be synthesized. There is a great interest in synthetic methods 
directed toward the creation of large collections of small organic compounds, or libraries, 
which could be screened for pharmacological, biological or other activity (Dolle, 2001, J. 
Comb. Chem. 3:477-517; Hall etal, 2001, J. Comb. Chem. 3:125-150; Dolle, 2000, J. 
Comb. Chem. 2:383-433; Dolle, 1999, J. Comb. Chem. 1:235-282). The synthetic methods 
applied to create vast combinatorial libraries are performed in solution or in the solid phase, 
i.e., on a solid support Solid-phase synthesis makes it easier to conduct multi-step 
reactions and to drive reactions to completion with high yields because excess reagents can 
be easily added and washed away after each reaction step. Solid-phase combinatorial 
synthesis also tends to improve isolation, purification and screening. However, the more 
traditional solution phase chemistry supports a wider variety of organic reactions than 
solid-phase chemistry. Methods and strategies for the synthesis of combinatorial libraries 
can be found in A Practical Guide to Combinatorial Chemistry, A.W. Czarnik and S.H. 
Dewitt , eds., American Chemical Society, 1997; The Combinatorial Index, B.A. Bunin, 
Academic Press, 1998; Organic Synthesis on Solid Phase, F.Z. D5rwald, Wiley-VCH, 
2000; and Solid-Phase Organic Syntheses, Vol 1, A.W. Czarnik, ed., Wiley Interscience, 
2001. 

Combinatorial compound libraries of the present invention may be 
synthesized using apparatuses described in US Patent No. 6,358,479 to Frisina et al, U.S. 
Patent No. 6,190,619 to Kilcoin etal, US Patent No. 6,132,686 to Gallup etal, US Patent 
No. 6, 126,904 to Zuellig et al , US Patent No. 6,074,6 1 3 to Harness et al , US Patent No. 
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6,054,100 to Stanchfield etal, and US Patent No. 5,746,982 to Saneii et al. which are 
hereby incorporated by reference in their entirety. These patents describe synthesis 
apparatuses capable of holding a plurality of reaction vessels for parallel synthesis of 
multiple discrete compounds or for combinatorial libraries of compounds. 

In one embodiment, the combinatorial compound library can be synthesized 
in solution. The method disclosed in U.S. Patent No. 6,194,612 to Boger etal, which is 

"i 

hereby incorporated by reference in its entirety, features compounds useful as templates for 
solution phase synthesis of combinatorial libraries. The template is designed to permit 
reaction products to be easily purified from unreacted reactants using liquid/liquid or 
solid/liquid extractions. The compounds produced by combinatorial synthesis using the 
template will preferably be small organic molecules. Some compounds in the library may 
mimic the effects of non-peptides or peptides. In contrast to solid-phase synthesis of 
combinatorial compound libraries, liquid-phase synthesis does not require the use of 
specialized protocols for monitoring the individual steps of a multistep solid-phase 
synthesis (Egner etal, 1995, J.Org. Chem. 60:2652; Anderson et al, 1995, J. Org. Chem. 
60:2650; Fitch et al, 1994, J. Org. Chem. 59:7955; Look et al, 1994, J. Org. Chem. 
49:7588; Metzger et al, 1993, Angew. Chem., Int. Ed. Engl. 32:894; Youngquist et al, 
1994, Rapid Commun. Mass Spect. 8:77; Chu et al, 1995, J. Am. Chem. Soc. 1 17:5419; 
Brummel et al, 1994, Science 264:399; Stevanovic et al, 1993, Bioorg. Med. Chem. Lett. 
3:431). 

Combinatorial compound libraries useful for the methods of the present 
invention can be synthesized on solid supports. In one embodiment, a split synthesis 
method, a protocol of separating and mixing solid supports during the synthesis, is used to 
synthesize a library of compounds on solid supports {see Lam et al, 1997, Chem. Rev. 
97:41-448; Ohlmeyer etal, 1993, Proc. Natl. Acad. Sci. USA 90:10922-10926 and 
references cited therein). Each solid support in the final library has substantially one type of 
test compound attached to its surface. Other methods for synthesizing combinatorial 
libraries on solid supports, wherein one product is attached to each support, will be known 
to those of skill in the art (see, e.g., Nefzi et al., 1997, Chem. Rev. 97:449-472 and US 
Patent No. 6,087,186 to Cargill et al which are hereby incorporated by reference in their 
entirety). 

As used herein, the term "solid support" is not limited to a specific type of 
solid support. Rather a large number of supports are available and are known to one skilled 
in the art. Solid supports include silica gels, resins, derivatized plastic films, glass beads, 
cotton, plastic beads, polystyrene beads, alumina gels, and polysaccharides. A suitable solid 



- 18- 



NY2- 1320351.1 



support may be selected on the basis of desired end use and suitability for various synthetic 
protocols. For example, for peptide synthesis, a solid support can be a resin such as p- 
methylbenzhydiylamine (pMBHA) resin (Peptides International, Louisville, KY), 
polystyrenes (e.g., PAM-resin obtained from Bachem Inc., Peninsula Laboratories, etc.), 
including chloromethylpolystyrene, hydroxymethylpolystyrene and 
aminomethylpolystyrene, poly (dimethylacrylamide)-grafied styrene co-divinyl-benzene 
(e.g., POLYHEPE resin, obtained from Aminotech, Canada), polyamide resin (obtained 
from Peninsula Laboratories), polystyrene resin grafted with polyethylene glycol (e.g., 
q TENTAGEL or ARGOGEL, Bayer, Tubingen, Germany) polydimethylacrylamide resin 
(obtained from Milligen/Biosearch, California), or Sepharose (Pharmacia, Sweden). 

In one embodiment, the solid phase support is suitable for in vivo use, i.e. , it 
can serve as a carrier or support for administration of the test compound to a patient (e.g., 
TENTAGEL, Bayer, Tubingen, Germany). In a particular embodiment, the solid support is 

^ palatable and/or orally ingestable. 

In some embodiments of the present invention, compounds can be attached 
to solid supports via linkers. Linkers can be integral and part of the solid support, or they 
may be nonintegral that are either synthesized on the solid support or attached thereto after 
synthesis. Linkers are useful not only for providing points of test compound attachment to 

^ the solid support, but also for allowing different groups of molecules to be cleaved from the 
solid support under different conditions, depending on the nature of the linker. For 
example, linkers can be, inter alia, electrophilically cleaved, nucleophilically cleaved, 
photocleavable, enzymatically cleaved, cleaved by metals, cleaved under reductive 
conditions or cleaved under oxidative conditions. 

2^ In another embodiment, the combinatorial compound libraries can be 

assembled in situ using dynamic combinatorial chemistry as described in European Patent 
Application 1,1 18,359 Al to Lehn; Hue & Nguyen, 2001, Comb. Chem. High Throughput. 
Screen. 4:53-74; Lehn and Eliseev, 2001, Science 291:2331-2332; Cousins etal 2000, 
Curr. Opin. Chem. Biol. 4: 270-279; and Karan & Miller, 2000, Drug. Disc. Today 5:67-75 
which are incorporated by reference in their entirety. 

Dynamic combinatorial chemistry uses non-covalent interaction with a target 
biomolecule, including but not limited to a protein, RNA, or DNA, to favor assembly of the 
most tightly binding molecule that is a combination of constituent subunits present as a 
mixture in the presence of the biomolecule. According to the laws of thermodynamics, 

^ when a collection of molecules is able to combine and recombine at equilibrium through 
reversible chemical reactions in solution, molecules, preferably one molecule, that bind 
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most tightly to a templating biomolecule will be present in greater amount than all other 
possible combinations. The reversible chemical reactions include, but are not limited to, 
imine, acyl-hydrazone, amide, acetal, or ester formation between carbonyl-containing 
compounds and amines, hydrazines, or alcohols; thiol exchange between disulfides; alcohol 
exchange in borate esters; Diels- Alder reactions; thermal- or photoinduced sigmatropic or 
electrocyclic rearrangements; or Michael reactions. 

In the preferred embodiment of this technique, the constituent components of 
the dynamic combinatorial compound library are allowed to combine and reach equilibrium 
in the absence of the target RNA and then incubated in the presence of the target RNA, 
preferably at physiological conditions, until a second equilibrium is reached. The second, 
perturbed, equilibrium (the so-called "templated mixture") can, but need not necessarily, be 
fixed by a further chemical transformation, including but not limited to reduction, oxidation, 
hydrolysis, acidification, or basification, to prevent restoration of the original equilibrium 
when the dynamical combinatorial compound library is separated from the target RNA. 

In the preferred embodiment of this technique, the predominant product or 
products of the templated dynamic combinatorial library can separated from the minor 
products and directly identified. In another embodiment, the identity of the predominant 
product or products can be identified by a deconvolution strategy involving preparation of 
derivative dynamic combinatorial libraries, as described in European Patent Application 
1 , 1 1 8,359 Al , which is incorporated by reference in its entirety, whereby each component 
of the mixture is, preferably one-by-one but possibly group-wise, left out of the mixture and 
the ability of the derivative library mixture at chemical equilibrium to bind the target RNA 
is measured. The components whose removal most greatly reduces the ability of the 
derivative dynamic combinatorial library to bind the target RNA are likely the components 
of the predominant product or products in the original dynamic combinatorial library. 

5.4. Library Screening 

After a target nucleic acid, such as but not limited to RNA or DNA, is 
labeled and a test compound library is synthesized or purchased or both, the labeled target 
nucleic acid is used to screen the library to identify test compounds that bind to the nucleic 
acid. Screening comprises contacting a labeled target nucleic acid with an individual, or 
small group, of the components of the compound library. Preferably, the contacting occurs 
in an aqueous solution, and most preferably, under physiologic conditions. The aqueous 
solution preferably stabilizes the labeled target nucleic acid and prevents denaturation or 
degradation of the nucleic acid without interfering with binding of the test compounds. The 



-20- 



NY2- 1320351.1 



10 



15 



20 



25 



30 



35 



aqueous solution can be similar to the solution in which a complex between the target RNA 
and its corresponding host cell factor (if known) is formed in vitro. For example, TK 
buffer, which is commonly used to form Tat protein-TAR RNA complexes in vitro, can be 
used in the methods of the invention as an aqueous solution to screen a library of test 
compounds for TAR RNA binding compounds. 

The methods of the present invention for screening a library of test 
compounds preferably comprise contacting a test compound with a target nucleic acid in the 
presence of an aqueous solution, the aqueous solution comprising a buffer and a 
combination of salts, preferably approximating or mimicking physiologic conditions. The 
aqueous solution optionally further comprises non-specific nucleic acids, such as, but not 
limited to, DNA; yeast tRNA; salmon sperm DNA; homoribopolymers such as, but not 
limited to, poly IC, polyA, polyU, and polyC; and non-specific RNA. The non-specific 
RNA may be an unlabeled target nucleic acid having a mutation at the binding site, which 
renders the unlabeled nucleic acid incapable of interacting with a test compound at that site. 
For example, if dye-labeled TAR RNA is used to screen a library, unlabeled TAR RNA 
having a mutation in the uracil 23/cytosine 24 bulge region may also be present in the 
aqueous solution. Without being bound by any theory, the addition of unlabeled RNA that 
is essentially identical to the dye-labeled target RNA except for a mutation at the binding 
site might minimize interactions of other regions of the dye-labeled target RNA with test 
compounds or with the solid support and prevent false positive results. 

The solution further comprises a buffer, a combination of salts, and 
optionally, a detergent or a surfactant. The pH of the solution typically ranges from about 5 
to about 8, preferably from about 6 to about 8, most preferably from about 6.5 to about 8. A 
variety of buffers may be used to achieve the desired pH. Suitable buffers include, but are 
not limited to, Tris, Mes, Bis-Tris, Ada, Aces, Pipes, Mopso, Bis-Tris propane, Bes, Mops, 
Tes, Hepes, Dipso, Mobs, Tapso, Trizma, Heppso, Popso, TEA, Epps, Tricine, Gly-Gly, 
Bicine, and sodium-potassium phosphate. The buffering agent comprises from about 10 
mM to about 100 mM, preferably from about 25 mM to about 75 mM, most preferably from 
about 40 mM to about 60 mM buffering agent. The pH of the aqeuous solution can be 
optimized for different screening reactions, depending on the target RNA used and the types 
of test compounds in the library, and therefore, the type and amount of the buffer used in the 
solution can vary from screen to screen. In a preferred embodiment, the aqueous solution 
has a pH of about 7.4, which can be achieved using about 50 mM Tris buffer. 

In addition to an appropriate buffer, the aqueous solution further comprises a 
combination of salts, from about 0 mM to about 100 mM KC1, from about 0 mM to about 1 
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M NaCl, and from about 0 mM to about 200 mM MgCl 2 . In a preferred embodiment, the 
combination of salts is about 100 mM KC1, 500 mM NaCl, and 10 mM MgCl 2 . Without 
being bound by any theory, Applicant has found that a combination of KC1, NaCl, and 
MgCl 2 stabilizes the target RNA such that most of the RNA is not denatured or digested 
over the course of the screening reaction. The optional concentration of each salt used in 
the aqueous solution is dependent on the particular target RNA used and can be determined 

i 

using routine experimentation. 

The solution optionally comprises from about 0.01% to about 0.5% (w/v) of 
a detergent or a surfactant. Without being bound by any theory, a small amount of detergent 
or surfactant in the solution might reduce non-specific binding of the target RNA to the 
solid support and control aggregation and increase stability of target RNA molecules. 
Typical detergents useful in the methods of the present invention include, but are not 
limited to, anionic detergents, such as salts of deoxycholic acid, 1-heptanesulfonic acid, N- 
laurylsarcosine, lauryl sulfate, 1 -octane sulfonic acid and taurocholic acid; cationic 
detergents such as benzalkonium chloride, cetylpyridinium, methylbenzethonium chloride, 
and decamethonium bromide; zwitterionic detergents such as CHAPS, CHAPSO, alkyl 
betaines, alkyl amidoalkyl betaines, N-dodecyl-N,N-dimethyl-3-ammonio-l- 
propanesulfonate, and phosphatidylcholine; and non-ionic detergents such as n-decyl a-D- 
glucopyranoside, n-decyl B-D-maltopyranoside, n-dodecyl fl-D-maltoside, n-octyl B-D- 
glucopyranoside, sorbitan esters, n-tetradecyl fi-D-maltoside, octylphenoxy 
polyethoxyethanol (Nonidet P-40), nonylphenoxypolyethoxyethanol (NP-40), and tritons. 
Preferably, the detergent, if present, is a nonionic detergent. Typical surfactants useful in 
the methods of the present invention include, but are not limited to, ammonium lauryl 
sulfate, polyethylene glycols, butyl glucoside, decyl glucoside, Polysorbate 80, lauric acid, 
myristic acid, palmitic acid, potassium palmitate, undecanoic acid, lauryl betaine, and lauryl 
alcohol. More preferably, the detergent, if present, is Triton X-100 and present in an 
amount of about 0.1% (w/v). 

Non-specific binding of a labeled target nucleic acid to test compounds can 
be further minimized by treating the binding reaction with one or more blocking agents. In 
one embodiment, the binding reactions are treated with a blocking agent, e.g., bovine serum 
albumin ("BSA"), before contacting with to the labeled target nucleic acid. In another 
embodiment, the binding reactions are treated sequentially with at least two different 
blocking agents. This blocking step is preferably performed at room temperature for from 
about 0.5 to about 3 hours. In a subsequent step, the reaction mixture is further treated with 
unlabeled RNA having a mutation at the binding site. This blocking step is preferably 
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performed at about 4°C for fiom about 12 hours to about 36 hours before addition of the 
dye-labeled target RNA. Preferably, the solution used in the one or more blocking steps is 
substantially similar to the aqueous solution used to screen the library with the dye-labeled 
target RNA, e.g., in pH and salt concentration. 

Once contacted, the mixture of labeled target nucleic acid and the test 
compound is preferably maintained at 4°C for from about 1 day to about 5 days, preferably 
from about 2 days to about 3 days with constant agitation. To identify the reactions in 
which binding to the labeled target nucleic acid occurred, after the incubation period, bound 
from free compounds are determined using an electrophoretic technique (see Section 5.5.1), 
or any of the methods disclosed in Section 5.5 infra. In another embodiment, the 
complexed target nucleic acid does not need to be separated from the free target nucleic acid 
if a technique (i.e., spectrometry) that diferentiates between bound and unbound target 
nucleic acids is used. 

The methods for identifying small molecules bound to labeled nucleic acid 
will vary with the type of label on the target nucleic acid- For example, if a target RNA is 
labeled with a visible of fluorescent dye, the target RNA complexes are preferably identified 
using a chromatographic technique that separates bound from free target by an 
electrophoretic or size differential technique using individual reactions. The reactions 
corresponding to changes in the migration of the complexed RNA can be cross-referenced 
to the small molecule compound(s) added to said reaction. Alternatively, complexed target 
RNA can be screened en masse and then separated from free target RNA using an 
electrophoretic or size differential technique, the resultant complexed target is then analyzed 
using a mass spectrometric technique. In this fashion the bound small molecule can be 
identified on the basis of its molecular weight. In this reaction a priori knowledge of the 
exact molecular weights of all compounds within the library is known. In another 
embodiment, the test compounds bound to the target nucleic acid may not require separation 
from the unbound target nucleic acid if a technique such as, but not limited to, spectrometry 
is used. 

5.5. Separation Methods for Screening Test Compounds 
Any method that detects an altered physical property of a target nucleic acid 
complexed to a test compound from the unbound target nucleic acid may be used for 
separation of the complexed and non-complexed target nucleic acids. Methods that can be 
utilized for the physical separation of complexed target RNA from unbound target RNA 
include, but are not limited to, electrophoresis, fluorescence spectroscopy, surface plasmon 
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resonance, mass spectrometry, scintillation, proximity assay, structure-activity relationships 
("SAR") by NMR spectroscopy, size exclusion chromatography, affinity chromatography, 
and nanoparticle aggregation. 

5.5.1. Electrophoresis 

Methods for separation of the complex of a target RNA bound to a test 
compound from the unbound RNA comprises any method of electrophoretic separation, 
including but not limited to, denaturing and non-denaturing polyacrylamide gel 
electrophoresis, urea gel electrophoresis, gel filtration, pulsed field gel electrophoresis, two 
dimensional gel electrophoresis, continuous flow electrophoresis, zone electrophoresis, 
agarose gel electrophoresis, and capillary electrophoresis. 

In a preferred embodiment, an automated electrophoretic system comprising 
a capillary cartridge having a plurality of capillary tubes is used for high-throughput 
screening of test compounds bound to target RNA. Such an apparatus for performing 
automated capillary gel electrophoresis is disclosed in U.S. Patent Nos. 5,885,430; 
5,916,428; 6,027,627; and 6,063,251, the disclosures of which are incorporated by reference 
in their entireties. 

The device disclosed in U.S. Patent No. 5,885,430, which is incorporated by 
reference in its entirety, allows one to simultaneously introduce samples into a plurality of 
capillary tubes directly from microtiter trays having a standard size. U.S. Patent No. 
5,885,430 discloses a disposable capillary cartridge which can be cleaned between 
electrophoresis runs, the cartridge having a plurality of capillary tubes. A first end of each 
capillary tube is retained in a mounting plate, the first ends collectively forming an array in 
the mounting plate. The spacing between the first ends corresponds to the spacing between 
the centers of the wells of a microtiter tray having a standard size. Thus, the first ends of 
the capillary tubes can simultaneously be dipped into the samples present in the tray's wells. 
The cartridge is provided with a second mounting plate in which the second ends of the 
capillary tubes are retained. The second ends of the capillary tubes are arranged in an array 
which corresponds to the wells in the microtiter tray, which allows for each capillary tube to 
be isolated from its neighbors and therefore free from cross-contamination, as each end is 
dipped into an individual well. 

Plate holes may be provided in each mounting plate and the capillary tubes 
inserted through these plate holes. In such a case, the plate holes are sealed airtight so that 
the side of the mounting plate having the exposed capillary ends can be pressurized. 
Application of a positive pressure in the vicinity of the capillary openings in this mounting 
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plate allows for the introduction of air and fluids during electrophoretic operations and also 
can be used to force out gel and other materials from the capillary tubes during 
reconditioning. The capillary tubes may be protected from damage using a needle 
comprising a cannula and/or plastic tubes, and the like when they are placed in these plate 
holes. When metallic cannula or the like are used, they can serve as electrical contacts for 
current flow during electrophoresis. In the presence of a second mounting plate, the second 
mounting plate is provided with plate holes through which the second ends of the capillary 
tubes project. In this instance, the second mounting plate serves as a pressure containment 
member of a pressure cell and the second ends of the capillary tubes communicate with an 
internal cavity of the pressure cell. The pressure cell is also formed with an inlet and an 
outlet. Gels, buffer solutions, cleaning agents, and the like may be introduced into the 
internal cavity through the inlet, and each of these can simultaneously enter the second ends 
of the capillaries. 

In another preferred embodiment, the automated electrophoretic system can 
comprise a chip system consisting of complex designs of interconnected channels that 
perform and analyze enzyme reactions using part of a channel design as a tiny, continuously 
operating electrophoresis material, where reactions with one sample are going on in one 
area of the chip while electrophoretic separation of the products of another sample is taking 
place in a different part of the chip. Such a system is disclosed in U.S. Patent Nos. 
5,699,157; 5,842,787; 5,869,004; 5,876,675; 5,942,443; 5,948,227; 6,042,709; 6,042,710; 
6,046,056; 6,048,498; 6,086,740; 6,132,685; 6,150,119; 6,150,180; 6,153,073; 6,167,910; 
6,171,850; and 6,186,660, the disclosures of which are incorporated by reference in their 
entireties. 

The system disclosed in U.S. Patent No. 5,699,157, which is hereby 
incorporated by reference in its entirety, provides for a microfluidic system for high-speed 
electrophoretic analysis of subject materials for applications in the fields of chemistry, 
biochemistry, biotechnology, molecular biology and numerous other areas. The system has 
a channel in a substrate, a light source and a photoreceptor. The channel holds subject 
materials in solution in an electric field so that the materials move through the channel and 
separate into bands according to species. The light source excites fluorescent light in the 
species bands and the photoreceptor is arranged to receive the fluorescent light from the 
bands. The system further has a means for masking the channel so that the photoreceptor 
can receive the fluorescent light only at periodically spaced regions along the channel. The 
system also has an unit connected to analyze the modulation frequencies of light intensity 
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received by the photoreceptor so that velocities of the bands along the channel are 
determined, which allows the materials to be analyzed. 

The system disclosed in U.S. Patent No. 5,699,157 also provides for a 

5 method of performing high-speed electrophoretic analysis of subject materials, which 
comprises the steps of holding the subject materials in solution in a channel of a 
microfluidic system; subjecting the materials to an electric field so that the subject materials 
move through the channel and separate into species bands; directing light toward the 
channel; receiving light from periodically spaced regions along the channel simultaneously; 

1Q and analyzing the frequencies of light intensity of the received light so that velocities of the 
bands along the channel can be determined for analysis of said materials. The 
determination of the velocity of a species band determines the electrophoretic mobility of 
the species and its identification. 

U.S. Patent No. 5,842,787, which is hereby incorporated by reference in its 

j 5 entirety, is generally directed to devices and systems employ channels having, at least in 
part, depths that are varied over those which have been previously described (such as the 
device disclosed in U.S. Patent No. 5,699,157), wherein said channel depths provide 
numerous beneficial and unexpected results such as but not limited to, a reduction in sample 
perturbation, reduced non-specific sample mixture by diffusion, and increased resolution. 

20 mother embodiment, the electrophoretic method of separation comprises 

polyacrylamide gel electrophoresis. In a preferred embodiment, the polyacrylamide gel 
electrophoresis is non-denaturing, so as to differentiate the mobilities of the target RNA 
bound to a test compound from free target RNA. If the polyacrylamide gel electrophoresis 
is denaturing, then the target RNArtest compound complex must be cross-linked prior to 

25 electrophoresis to prevent the disassociation of the target RNA from the test compound 
during electrophoresis. Such techniques are well known to one of skill in the art. 

In one embodiment of the method, the binding of test compounds to target 
nucleic acid can be detected, preferably in an automated fashion, by gel electrophoretic 
analysis of interference footprinting. RNA can be degraded at specific base sites by 

30 enzymatic methods such as ribonucleases A, U 2 , CL 3 , T„ Phy M, and B. cereus or chemical 
methods such as diethylpyrocarbonate, sodium hydroxide, hydrazine, piperidine formate, 
dimethyl sulfate, 

[2,12-dimethyl-3,7,l l,17-tetraazacyclo[l 1 .3.1]heptadeca-l(17),2,l 1,13,15-pentaenato] 
nickel(II) (NiCR), cobalt(II)chloride, or iron(II) ethylenediaminetetraacetate (Fe-EDTA) as 
35 described for example in Zheng et al % 1999, Biochem. 37:2207-2214; Latham & Cech, 
1989, Science 245:276-282; and Sambrook etal, 2001, in Molecular Cloning: A 
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Laboratory Manual, pp 12.61-12.73, Cold Spring Harbor Laboratory Press, and the 
references cited therein, which are hereby incorporated by reference in their entireties. The 
specific pattern of cleavage sites is determined by the accessibility of particular bases to the 
reagent employed to initiate cleavage and, as such, is therefore is determined by the 
three-dimensional structure of the RNA. 

The interaction of small molecules with a target nucleic acid can change the 
accessibility of bases to these cleavage reagents both by causing conformational changes in 
the target nucleic acid or by covering a base at the binding interface. When a test compound 
binds to the nucleic acid and changes the accessibility of bases to cleavage reagents, the 
observed cleavage pattern will change. This method can be used to identify and 
characterize the binding of small molecules to RNA as described, for example, by Prudent 
et al, 1995, J. Am. Chem. Soc. 1 17:10145-10146 and Mei et al 3 1998, Biochem. 
37:14204-14212. 

In the preferred embodiment of this technique, the detectably labeled target 
nucleic acid is incubated with an individual test compound and then subjected to treatment 
with a cleavage reagent, either enzymatic or chemical. The reaction mixture can be 
preferably be examined directly, or treated further to isolate and concentrate the nucleic 
acid. The fragments produced are separated by electrophoresis and the pattern of cleavage 
can be compared to a cleavage reaction performed in the absence of test compound. A 
change in the cleavage pattern directly indicates that the test compound binds to the target 
nucleic acid. Multiple test compounds can be examined both in parallel and serially. 

Other embodiments of electrophoretic separation include, but are not limited 
to urea gel electrophoresis, gel filtration, pulsed field gel electrophoresis, two dimensional 
gel electrophoresis, continuous flow electrophoresis, zone electrophoresis, and agarose gel 
electrophoresis. 



5.5.2. Fluorescence Spectroscopy 
In a preferred embodiment, fluorescence polarization spectroscopy, an 
optical detection method that can differentiate the proportion of a fluorescent molecule that 
is either bound or unbound in solution (e.g., the labeled target nucleic acid of the present 
invention), can be used to read reaction results without electrophoretic separation of the 
samples. Fluorescence polarization spectroscopy can be used to read the reaction results in 
the chip system disclosed in U.S. Patent Nos. 5,699,157; 5,842,787; 5,869,004; 5,876,675; 
5,942,443; 5,948,227; 6,042,709; 6,042,710; 6,046,056; 6,048,498; 6,086,740; 6,132,685; 
6,150,1 19; 6,150,180; 6,153,073; 6,167,910; 6,171,850; and 6,186,660, the disclosures of 
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which are incorporated by reference in their entireties. The application of fluorescence 
polarization spectroscopy to the chip system disclosed in the U.S. Patents listed supra is 
fast, efficient, and well-adapted for high-throughput screening. 

5 In another embodiment, a compound that has an affinity for the target nucleic 

acid of interest can be labeled with a fluorophore to screen for test compounds that bind to 
the target nucleic acid. For example, a pyrene-containing aminoglycoside analog was used 
to accurately monitor antagonist binding to a prokaiyotic 16S rRNA A site (which 
comprises the natural target for aminoglycoside antibiotics) in a screen using a fluorescence 

1Q quenching technique in a 96-well plate format (Hamasaki & Rando, 1998, Anal. Biochem. 
26 1(2): 183-90). 

In another embodiment, fluorescence resonance energy transfer (FRET) can 
be used to screen for test compounds that bind to the target nucleic acid. FRET, a 
characteristic change in fluorescence, occurs when two fluorophores with overlapping 
1 5 emission and excitation wavelength bands are held together in close proximity, such as by a 
binding event In the preferred embodiment, the fluorophore on the target nucleic acid and 
the fluorophore on the test compounds will have overlapping excitation and emission 
spectra such that one fluorophore (the donor) transfers its emission energy to excite the 
other fluorophore (the acceptor). The acceptor preferably emits light of a different 

2Q wavelength upon relaxing to the ground state, or relaxes non-radiatively to quench 

fluorescence. FRET is very sensitive to the distance between the two fluorophores, and 
allows measurement of molecular distances less than 10 nm. For example, U.S, Patent 
6,337,183 to Arenas et al 9 which is incorporated by reference in its entirety, describes a 
screen for compounds that bind RNA that uses FRET to measure the effect of test 

25 compounds on the stability of a target RNA molecule where the target RNA is labeled with 
both fluorescent acceptor and donor molecules and the distance between the two 
fluorophores as determined by FRET provides a measure of the folded structure of the 
RNA. Matsumoto et al (2000, Bioorg. Med. Chem. Lett. 10:1857-1861) describe a system 
where a peptide that binds to HIV-1 TAR RNA is labeled on one end with a fluorescein 

3Q fluorophore and a tetramethylrhodamine on the other end. The conformational change of 
the peptide upon binding to the RNA provided a FRET signal to screen for compounds that 
bound to the TAR RNA. 

In the preferred embodiment, both the target nucleic acid and a compound 
that has an affinity for the target nucleic acid of interest are labeled with fluorophores with 

35 overlapping emission and excitation spectra (donor and acceptor), including but not limited 
to fluorescein and derivatives, rhodamine and derivatives, cyanine dyes and derivatives, 
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bora-3a,4a-diaza-s-indacene (BODIPY®) and derivatives, pyrene, nanoparticles, or 
non-fluorescent quenching molecules. Binding of a labeled test compound to the target 
nucleic acid can be identified by the change in observable fluorescence as a result of FRET. 

^ If the target nucleic acid is labeled with the donor fluorophore, then the test 

compounds is labeled with the acceptor fluorophore. Conversely, if the target nucleic acid 
is labeled with the acceptor fluorophore, then the test compounds is labeled with the donor 
fluorophore. A wide variety of labels may be used, with the choice of label depending on 
sensitivity required, ease of conjugation with the compound, stability requirements, 

1Q available instrumentation, and disposal provisions. The fluorophore on the target nucleic 
acid must be in close proximity to the binding site of the test compounds, but should not be 
incorporated into a target nucleic acid at the specific binding site at which test compounds 
are likely to bind, since the presence of a covalently attached label might interfere sterically 
or chemically with the binding of the test compounds at this site. 

j 5 In yet another embodiment, homogeneous time-resolved fluorescence 

("HTRF") techniques based on time-resolved energy transfer from lanthanide ion 
complexes to a suitable acceptor species can be adapted for high-throughput screening for 
inhibitors of RNA-protein complexes (Hemmila, 1999, J. Biomol. Screening 4:303-307; 
Mathis, 1999, J. Biomol. Screening 4:309-3 13). HTRF is similar to fluorescence resonance 

2Q energy transfer using conventional organic dye pairs, but has several advantages, such as 
increased sensitivity and efficiency, and background elimination (Xavier et aL, 2000, 
Trends Biotechnol. 18(8):349-356). 

Fluorescence spectroscopy has traditionally been used to characterize DNA- 
protein and protein-protein interactions, but fluorescence spectroscopy has not been widely 

25 used to characterize RNA-protein interactions because of an interfering absorption of RNA 
nucleotides with the intrinsic tryptophan fluorescence of proteins (Xavier et al, 2000, 
Trends Biotechnol. 18(8):349-356.). However, fluorescence spectroscopy has been used in 
studying the single tryptophan residue within the arginine-rich RNA-binding domain of Rev 
protein and its interaction with the RRE in a time-resolved fluorescence study (Kwon & 

3Q Carson, 1998, Anal. Biochem. 264:133-140). Thus, in this invention, fluorescence 

spectroscopy is less preferred if the test compounds or peptides or proteins possess intrinsic 
tryptophan fluorescence. However, fluorescence spectroscopy can be used for test 
compounds that do not possess intrinsic fluorescence. 



35 
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5.53. Surface Plasmon Resonance ("SPR»1 
Surface plasmon resonance (SPR) can be used for determining kinetic rate 
constants and equilibrium constants for macromolecular interactions by following the 
association project in "real time" (Schuck, 1997, Annu. Rev. Biophys. BiomoL Struct. 
26:541-566). 

The principle of SPR is summarized by Xavier et al (Trends Biotechnol., 
2000, 18(8):349-356) as follows. Total internal reflection occurs at the boundary between 
two substances of different refractive index. The incident light's electromagnetic field 

jq penetrates beyond the interface as an evanescent wave, which extends a few hundred 

nanometers beyond the surface into the medium. Insertion of a thin gold foil at the interace 
produced SPR owing to the absorption of the energy from the evanescent wave by free 
electron clouds of the metal (plasmons). As a result of this absorbance, there is a drop in 
the intensity of the reflected light at a particular angle of incidence. The evanescent wave 

15 profile depends exquisitely on the refractive index of the medium it probes. Thus, the angle 
at which absorption occurs is very sensitive to the refractive changes in the external 
medium. All proteins and nucleic acids are known to change the refractive index of water 
by a similar amount per unit mass, irrespective of their amino acid or nucleotide 
composition (the refractive index change is different for proteins and nucleic acids). When 

2 Q the protein or nucleic acid content of the layer at the sensor changes, the refractive index 
also changes. Typically, one member of a complex is immobilized in a dextran layer and 
then the other member is introduced into the solution, either in a flow cell (Biacore AB, 
Uppsala, Sweden) or a stirred cuvette (Affinity Sensors, Santa Fe, New Mexico). It has 
been determined that there is a linear correlation between the surface concentration of 

2j protein or nucleic acid and the shift in resonance angle, which can be used to quantitate 
kinetic rate constants and/or the equilibrium constants. 

In the present invention, the target RNA may be immobilized to the sensor 
surface through a streptavidin-biotin linkage, the linkage of which is disclosed by Crouch et 
al. (Methods Mol. BioL, 1999, 1 18:143-160). The RNA is biotinylated either during 

3Q synthesis or post-synthetically via the conversion of the 3* terminal ribonucleoside of the 
RNA into a reactive free amino group or using a T7 polymerase incorporated guanosine 
monophosphorothioate at the 5' end. SPR has been used to determine the stoichiometry and 
affinity of the interaction between the HIV Rev protein and the RRE (Van Ryk & 
Venkatesan, 1999, J. Biol. Chem. 274:17452-17463) and the aminoglycoside antibiotics 

35 with RRE and a model RNA derived from the 16S ribosomal A site, respectively (Hendrix 
etaL, 1997, J. Am. Chem. Soc. 119:3641-3648; Wong etal, 1998, Chem. BioL 5:397-406). 
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In one embodiment of the present invention, the target nucleic acid can be 
immobilized to a sensor surface (e.g., by a streptavidin-biotin linkage) and SPR can be used 
to (a) determine whether the target RNA binds a test compound and (b) further characterize 
5 the binding of the target nucleic acids of the present invention to a test compound. 

5.5.4. Mass Spectrometry 

An automated method for analyzing mass spectrometer data which can 
analyze complex mixtures containing many thousands of components and can correct for 

1Q background noise, multiply charged peaks and atomic isotope peaks is described in U.S. 
Patent No. 6,147,344, which is hereby incorporated by reference in its entirety. The system 
disclosed in U.S. Patent No. 6,147,344 is a method for analyzing mass spectrometer data in 
which a control sample measurement is performed providing a background noise check. 
The peak height and width values at each m/z ratio as a function of time are stored in a 

^ memory. A mass spectrometer operation on a material to be analyzed is performed and the 
peak height and width values at each m/z ratio versus time are stored in a second memory 
location. The mass spectrometer operation on the material to be analyzed is repeated a 
fixed number of times and the stored control sample values at each m/z ratio level at each 
time increment are subtracted from each corresponding one from the operational runs, thus 

2Q producing a difference value at each mass ratio for each of the multiple runs at each time 
increment. If the MS value minus the background noise does not exceed a preset value, the 
m/z ratio data point is not recorded, thus eliminating background noise, chemical noise and 
false positive peaks from the mass spectrometer data. The stored data for each of the 
multiple runs is then compared to a predetermined value at each m/z ratio and the resultant 

25 series of peaks, which are now determined to be above the background, is stored in the m/z 
points in which the peaks are of significance. 

One possibility for the utilization of mass spectrometry in high throughput 
screening is the integration of SPR with mass spectrometry. Approaches that have been 
tried are direct analysis of the analyte retained on the sensor chip and mass spectrometry 

3Q with the eluted analyte (Sonksen et al , 1 998, Anal. Chem. 70:273 1 -2736; Nelson & Krone, 
1999, J. Mol. Recog. 12:77-93). Further developments, especially in the interfacing of the 
sensor chip with the mass spectrometer and in reusing the sensor chip, are required to make 
SPR combined with mass spectroscopy a high-throughput method for biomolecular 
interaction analysis and the screening of targets for small molecule inhibitors (Xavier et al 9 
35 2000, Trends Biotechnol. 18(8):349-356). 
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In one embodiment of the present invention, the target nucleic acid 
complexed to a test compound can be determined by any of the mass spectrometry 
processed described supra. Furthermore, mass spectrometry can also be used to elucidate 
the structure of the test compound. 



5.5.5. Scintillation Proximity Assay f "SPA") 
Scintillation proximity assay ("SPA") is a method that can be used for 
screening small molecules that bind to the target RNAs. SPA would involve radiolabeling 
either the target RNA or the test compound and then quantitating its binding to the other 
member to a bead or a surface impregnated with a scintillant (Cook, 1996, Drug Discov. 
Today 1 :287-294). Currently, fluorescence-based techniques are preferred for high- 
throughput screening (Pope et aL, 1999, Drug Discov. Today 4:350-362). 

Screening for small molecules that inhibit Tat peptide:TAR RNA interaction 
has been performed with SPA, and inhibitors of the interaction were isolated and 
characterized (Mei et aL, 1997, Bioorg. Med. Chem. 5:1 173-1 184; Mei et aL, 1998, 
Biochemistry 37:14204-14212). A similar approach can be used to identify small molecules 
that directly bind to a preselected target RNA element in accordance with the invention can 
be utilized. 

SPA can be adapted to high throughput screening by the availability of 
microplates, wherein the scintillant is directly incorporated into the plastic of the microtiter 
wells (Nakayama et aL, 1998, J. Biomol. Screening 3:43-48). Thus, one embodiment of the 
present invention comprises (a) labeling of the target nucleic acid with a radioactive or 
fluorescent label; (b) contacted the labeled nucleic acid with test compounds, wherein each 
test compound is in a microtiter well coated with scintillant and is tethered to the microtiter 
well; and (c) identifying and quantifying the test compounds bound to the target nucleic acid 
with SPA, wherein the test compound is identified by virtue of its location in the 
microplate. 



5.5.6. Structure-Activity Relationships ("SAR») by NMR Spectroscopy 

NMR spectroscopy is a valuable technique for identifying complexed target 
nucleic acids by qualitatively determining changes in chemical shift, specifically from 
distances measured using relaxation effects, and NMR-based approaches have been used in 
the identification of small molecule binders of protein drug targets (Xavier et aL, 2000, 
Trends BiotechnoL 18(8):349-356). The determination of structure-activity relationships 
("S AR") by NMR is the first method for NMR described in which small molecules that 
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bind adjacent subsites are identified by two-dimentional ! H- ,5 N spectra of the target protein 
(Shukere/fl/., 1996, Science 274:1531-1534). The signal from the bound molecule is 
monitored by employing line broadening, transferred NOEs and pulsed field gradient 
diffusion measurements (Moore, 1999, Cuit. Opin. BiotechnoL 10:54-58). A strategy for 
lead generation by NMR using a library of small molecules has been recently described 
(Fejzo et al. y 1999, Chem. Biol. 6:755-769). 



complexed to a test compound can be determined by SAR by NMR. Furthermore, SAR by 
NMR can also be used to elucidate the structure of the test compound. 



chromatography is used to purify test compounds that are bound to a target nucleic from a 
complex mixture of compounds. Size-exclusion chromatography separates molecules based 
on their size and uses gel-based media comprised of beads with specific size distributions. 
When applied to a column, this media settles into a tightly packed matrix and forms a 
complex array of pores. Separation is accomplished by the inclusion or exclusion of 
molecules by these pores based on molecular size. Small molecules are included into the 
20 pores an( *» consequently, their migration through the matrix is retarded due to the added 
distance they must travel before elution. Large molecules are excluded from the pores and 
migrate with the void volume when applied to the matrix. In the present invention, a target 
nucleic acid is incubated with a mixture of test compounds while free in solution and 
allowed to reach equilibrium. When applied to a size exclusion column, test compounds 
2^ free in solution are retained by the column, and test compounds bound to the target nucleic 
acid are passed through the column. In a preferred embodiment, spin columns commonly 
used for "desalting" of nucleic acids will be employed to separate bound from unbound test 
compounds (e.g., Bio-Spin columns manufactured by BIO-RAD). In another embodiment, 
the size exclusion matrix is packed into multiwell plates to allow high throughput 
30 separation of mixtures (e.g., PLASMID 96-well SEC plates manufactured by Millipore). 

5.5.8. Affinity Chromatography 

In one embodiment of the present invention, affinity capture is used to purify 
test compounds that are bound to a target nucleic acid labeled with an affinity tag from a 
35 complex mixture of compounds. To accomplish this, a target nucleic acid labeled with an 
affinity tag is incubated with a mixture of test compounds while free in solution and then 



In one embodiment of the present invention, the target nucleic acid 



5.5.7, Size Exclusion Chromatography 

In another embodiment of the present invention, size-exclusion 
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captured to a solid support once equilibrium has been established; alternatively, target 
nucleic acids labeled with an affinity tag can be captured to a solid support first and then 
allowed to reach equilibrium with a mixture of test compounds. 

5 The solid support is typically comprised of, but not limited to, cross-linked 

agarose beads that are coupled with a ligand for the affinity tag. Alternatively, the solid 
support may be a glass, silicon, metal, or carbon, plastic (polystyrene, polypropylene) 
surface with or without a self-assembled monolayer (SAM) either with a covalently attached 
ligand for the affinity tag, or with inherent affinity for the tag on the target nucleic acid 

j 0 Once the complex between the target nucleic acid and test compound has 

reached equilibrium and has been captured, one skilled in the art will appreciate that the 
retention of bound compounds and removal of unbound compounds is facilitated by 
washing the solid support with large excesses of binding reaction buffer. Furthermore, 
retention of high affinity compounds and removal of low affinity compounds can be 

^ accomplished by a number of means that increase the stringency of washing; these means 
include, but are not limited to, increasing the number and duration of washes, raising the 
salt concentration of the wash buffer, addition of detergent or surfactant to the wash buffer, 
and addition of non-specific competitor to the wash buffer. 

In one embodiment, the test compounds themselves are detectably labeled 

2Q with fluorescent dyes, radioactive isotopes, or nanoparticles. When the test compounds are 
applied to the captured target nucleic acid in a spatially addressed fashion (e.g., in separate 
wells of a 96-well microplate), binding between the test compounds and the target nucleic 
acid can be determined by the presence of the detectable label on the test compound using 
fluorescence. 

25 Following the removal of unbound compounds, bound compounds with high 

affinity for the target nucleic acid can be eluted from the immobilized target nucleic acids 
and analyzed. The elution of test compounds can be accomplished by any means that break 
the non-covalent interactions between the target nucleic acid and compound. Means for 
elution include, but are not limited to, changing the pH, changing the salt concentration, the 

3Q application of organic solvents, and the application of molecules that compete with the 
bound ligand. In a preferred embodiment, the means employed for elution will release the 
compound from the target RNA, but will not effect the interaction between the affinity tag 
and the solid support, thereby achieving selective elution of test compound. Moreover, a 
preferred embodiment will employ an elution buffer that is volatile to allow for subsequent 

35 concentration by lyophilization of the eluted compound (e.g., 0 M to 5 M ammonium 
acetate). 
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5.5.9. Nanoparticle Aggregation 

In one embodiment of the pTesent invention, both the target nucleic acid and 
the test compounds are labeled with nanoparticles. A nanoparticle is a cluster of ions with 

5 controlled size from 0.1 to 1000 nm comprised of metals, metal oxides, or semiconductors 
including, but not limited to Ag 2 S, ZnS, CdS, CdTe, Au, or Ti0 2 . Methods for the 
attachment of nucleic acids and small molecules to nanoparticles are well know to one of 
skill in the art (reviewed in Niemeyer, 2001, Angew. Chem. Int. Ed. 40:4129-4158. The 
references cited therein are hereby incorporated by reference in their entireties). In 

j q particular, if multiple copies of the target nucleic acid are attached to a single nanoparticle 
and multiple copies of a test compound are attached to another nanoparticle, then interaction 
between the test compound and target nucleic acid will induce aggregation of nanoparticles 
as described, for example, by Mitchel et al 1999, J. Am. Chem. Soc. 121 :8122-8123. The 
aggregate can be detected by changes in absorbance or fluorescence spectra and physically 

j ^ separated from the unbound components through filtration or centrifugation. 

5.6. Methods for Identifying or Characterizing the Test 
Compounds Bound to the Target Nucleic Acids 

If the library comprises arrays or microarrays of test compounds, wherein 

each test compound has an address or identifier, the test compound can be deconvoluted, 

e.g., by cross-referencing the positive sample to original compound list that was applied to 

the individual test assays. 

If the library is a peptide or nucleic acid library, the sequence of the test 

compound can be determined by direct sequencing of the peptide or nucleic acid. Such 

methods are well known to one of skill in the art. 

A number of physico-chemical techniques can be used for the de novo 

characterization of test compounds bound to the target. 



20 



25 



30 



35 



5.6.1. Mass Spectrometry 

Mass spectrometry (e.g., electrospray ionization ("ESI") and matrix-assisted 
laser desorption-ionization ("MALDI"), Fourier-transform ion cyclotron resonance ("FT- 
ICR")) can be used both for high-throughput screening of test compounds that bind to a 
target RNA and elucidating the structure of the test compound. Thus, one example of mass 
spectroscopy is that separation of a bound and unbound complex and test compound 
structure elucidation can be carried out in a single step. 



-35- 



NY2- 1320351.1 



MALDI uses a pulsed laser for desorption of the ions and a time-of-flight 
analyzer, and has been used for the detection of noncovalent tRN A: amino-acyl-tRNA 
synthetase complexes (Gruic-Sovulj etai, 1997, J. Biol. Chem. 272:32084-32091). 
However, covalent cross-linking between the target nucleic acid and the test compound is 
required for detection, since a non-covalently bound complex may dissociate during the 
MALDI process. 

ESI mass spectrometry ("ESI-MS") has been of greater utility for studying 
non-covalent molecular interactions because, unlike the MALDI process, ESI-MS generates 

2 q molecular ions with little to no fragmentation (Xavier et al , 2000, Trends Biotechnol. 

1 8(8):349-356). ESI-MS has been used to study the complexes formed by HIV Tat peptide 
and protein with the TAR RN A (Sannes-Lowery et al, 1997, Anal. Chem. 69:5130-5135). 

Fourier-transform ion cyclotron resonance ("FT-ICR") mass spectrometry 
provides high-resolution spectra, isotope-resolved precursor ion selection, and accurate 

15 mass assignments (Xavier et ai 9 2000, Trends Biotechnol. 18(8):349-356). FT-ICR has 
been used to study the interaction of aminoglycoside antibiotics with cognate and non- 
cognate RNAs (Hofstadler et ah, 1999, Anal. Chem. 71 :3436-3440; Griffey et al, 1999, 
Proc. Natl. Acad. Sci. USA 96: 10129-10133). As true for all of the mass spectrometry 
methods discussed herein, FT-ICR does not require labeling of the target RNA or a test 

2Q compound. 

An advantage of mass spectroscopy is not only the elucidation of the 
structure of the test compound, but also the determination of the structure of the test 
compound bound to the preselected target RNA. Such information can enable the discovery 
of a consensus structure of a test compound that specifically binds to a preselected target 

25 ™ A - 



5,6.2. NMR Spectroscopy 

As described above, NMR spectroscopy is a technique for identifying 
binding sites in target nucleic acids by qualitatively determining changes in chemical shift, 

30 specifically from distances measured using relaxation effects. Examples of NMR that can 
be used for the invention include, but are not limited to, one-dimentional NMR, two- 
dimentional NMR, correlation spectroscopy ("COSY"), and nuclear Overhauser effect 
("NOE") spectroscopy. Such methods of structure determination of test compounds are 
well known to one of skill in the art. 

35 Similar to mass spectroscopy, an advantage of NMR is the not only the 

elucidation of the structure of the test compound, but also the determination of the structure 
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of the test compound bound to the preselected target RNA. Such information can enable the 
discovery of a consensus structure of a test compound that specifically binds to a 
preselected target RNA. 

5 

5.6.3. Vibrational Spectroscopy 

Vibrational spectroscopy (e.g. infrared (IR) spectroscopy or Raman • 
spectroscopy) can be used for elucidating the structure of the test compound on the isolated 
bead. 

j q Infrared spectroscopy measures the frequencies of infrared light 

(wavelengths from 100 to 10,000 nm) absorbed by the test compound as a result of 
excitation of vibrational modes according to quantum mechanical selection rules which 
require that absorption of light cause a change in the electric dipole moment of the 
molecule. The infrared spectrum of any molecule is a unique pattern of absorption 
15 wavelengths of varying intensity that can be considered as a molecular fingerprint to 
identify any compound. 

Infrared spectra can be measured in a scanning mode by measuring the 
absorption of individual frequencies of light, produced by a grating which separates 
frequencies from a mixed-frequency infrared light source, by the test compound relative to a 

2Q standard intensity (double-beam instrument) or pre-measured ('blank') intensity 

(single-beam instrument). In a preferred embodiment, infrared spectra are measured in a 
pulsed mode (FT-IR) where a mixed beam, produced by an interferometer, of all infrared 
light frequencies is passed through or reflected off the test compound. The resulting 
interferogram, which may or may not be added with the resulting interferograms from 

25 subsequent pulses to increase the signal strength while averaging random noise in the 

electronic signal, is mathematically transformed into a spectrum using Fourier Transform or 
Fast Fourier Transform algorithms. 

Raman spectroscopy measures the difference in frequency due to absorption 
of infrared frequencies of scattered visible or ultraviolet light relative to the incident beam. 

3Q The incident monochromatic light beam, usually a single laser frequency, is not truly 

absorbed by the test compound but interacts with the electric field transiently. Most of the 
light scattered off the sample with be unchanged (Rayleigh scattering) but a portion of the 
scatter light will have frequencies that are the sum or difference of the incident and 
molecular vibrational frequencies. The selection rules for Raman (inelastic) scattering 

35 require a change in polarizability of the molecule. While some vibrational transitions are 
observable in both infrared and Raman spectrometry, must are observable only with one or 
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the other technique. The Raman spectrum of any molecule is a unique pattern of absorption 
wavelengths of varying intensity that can be considered as a molecular fingerprint to 
identify any compound. 

Raman spectra are measured by submitting monochromatic light to the 
sample, either passed through or preferably reflected off, filtering the Rayleigh scattered 
light, and detecting the frequency of the Raman scattered light. An improved Raman 
spectrometer is described in US Patent No. 5,786,893 to Fink et al y which is hereby 
incorporated by reference. 



address single beads by integration of a visible microscope and spectrometer. A 
microscopic infrared spectrometer is described in U.S. Patent No. 5,581,085 to Refiner et 
dl , which is hereby incorporated by reference in its entirety. An instrument that 
simultaneously performs a microscopic infrared and microscopic Raman analysis on a 
sample is described in U.S. Patent No. 5,841,139 to Sostek et ah, which is hereby 
incorporated by reference in its entirety. 



the IR or Raman spectra of a test compound to a dataset of vibrational (IR or Raman) 
spectra previously acquired for each compound in the combinatorial library. By this 
method, the spectra of compounds with known structure are recorded so that comparison 
with these spectra can identify compounds again when isolated from RNA binding 
experiments. 



used to elucidate the structure of the test compound. For example, as described in U.S. 
Patent Nos. 6,395,480; 6,376,258; 6,368,795; 6,340,568; 6,338,968; 6,287,874; and 
6,287,776 to Hefti, the disclosures of which are hereby incorporated by reference, the 
unique dielectric properties of molecules and binding complexes, such as hybridization 
complexes formed between a nucleic acid probe and a nucleic acid target, molecular 
binding events, and protein/ligand complexes, result in varying microwave spectra which 
can be measured. The molecule's dielectric properties can be observed by coupling a test 
signal to the molecule and observing the resulting signal. When the test signal excites the 
molecule at a frequency within the molecule's dispersion regime, especially at a resonant 
frequency, the molecule will interact strongly with the signal, and the resulting signal will 
exhibit dramatic variations in its measured amplitude and phase, thereby generating a 



Vibrational microscopy can be measured in a spatially resolved fashion to 



In the preferred embodiment, test compounds can be identified by matching 



5.6*4. Microwave Spectroscopy 

In another embodiment, the microwave spectra of a test compound, can be 
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unique signal response. This response can be used to detect and identify the bound 
molecular structure. In addition, because most molecules will exhibit different dispersion 
properties over the same or different frequency bands, each generates a unique signal 
response which can be used to identify the molecular structure. 

5.7. Secondary Biological Screens 

The test compounds identified in the nonsense suppression assay (for 
convenience referred to herein as a "lead" compound) can be tested for biological activity 
using an in vitro transcribed RNA from the gene with the premature translation termination 
codon and in vitro translating that RNA in a cell-free translation extract. The activity of the 
lead compound in the in vitro translation mixture can be determined by any method that 
measures increased expression of the full-length gene, Le. 9 past the premature termination 
codon. For example, expression of a functional protein from the full-length gene (e.g. , a 
reporter gene) can be measured to determine the effect of the lead compound on premature 
translation termination and/or nonsense-mediated mRNA decay in an in vitro system. 

In addition, the lead compound can be tested in a host cell engineered to 
contain the RNA with the premature translation termination codon controlling the 
expression of a reporter gene. In this example, the lead compounds are assayed in the 
presence or absence of the RNA with the premature translation termination codon. 
Compounds that modulate premature translation termination and/or nonsense-mediated 
mRNA decay in vivo will result in increased expression of the full-length gene, Le., past the 
premature termination codon. Alternatively, a phenotypic or physiological readout can be 
used to assess activity of the target RNA with the premature translation termination codon 
in the presence and absence of the lead compound. Both the in vitro and in vivo nonsense 
suppression assays used herein and as described in International Patent Publication WO 
01/44516, which is incorporated by reference in its entirety, can be used to identify lead 
compounds can also be used to determine an EC 50 for the lead compounds. 

The test compounds identified in the nonsense suppression assay (for 
convenience referred to herein as a "lead" compound) can be tested for biological activity 
using host cells containing or engineered to contain the target RNA element coupled to a 
functional readout system. For example, the lead compound can be tested in a host cell 
engineered to contain the RNA with the premature translation termination codon controlling 
the expression of a reporter gene. In this example, the lead compounds are assayed in the 
presence or absence of the RNA with the premature translation termination codon. 
Compounds that modulate premature translation termination and/or nonsense-mediated 
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mRNA decay in vivo will result in increased expression of the full-length gene, Le., past the 
premature termination codon. Alternatively, a phenotypic or physiological readout can be 
used to assess activity of the target RNA with the premature translation termination codon 
in the presence and absence of the lead compound. Both the in vitro and in vivo nonsense 
suppression assays used herein and as described in International Patent Publication WO 
01/44516, which is incorporated by reference in its entirety, can be used to identify lead 
compounds can also be used to determine an EC 50 for the lead compounds. 

Animal model systems can also be used to demonstrate the safety and 
efficacy of the lead compounds identified in the nonsense suppression assays described 
above. The lead compounds identified in the nonsense suppression assay can then be tested 
for biological activity using animal models for a disease, condition, or syndrome of interest. 
These include animals engineered to contain the target RNA element coupled to a functional 
readout system, such as a transgenic mouse. 

Examples of animal models for cystic fibrosis include, but are not limited to, 
cftr(-/-) mice (see, e.g, Freedman et al, 2001, Gastroenterology 121(4):950-7), 
cftr(tmlHGU/tmlHGU) mice (see, e.g., Bernhard et al, 2001, Exp Lung Res 27(4):349- 
66), CFTR-deficient mice with defective cAMP-mediated Cl(-) conductance (see, e.g., 
Stotland et al, 2000, Pediatr Pulmonol 30(5):4 13-24), and C57BL/6- 
Cftr(mlUNC)/Cftr(mlUNC) knockout mice (see, e.g., Stotland etal, 2000, Pediatr 
Pulmonol 30(5):413-24). 

Examples of animal models for muscular dystrophy include, but are not 
limited to, mouse, hamster, cat, dog, and C. elegans. Examples of mouse models for 
muscular dystrophy include, but are not limited to, the dy-A mouse (see, e.g., Connolly et 
al, 2002, J Neuroimmunol 127(l-2):80-7), a muscular dystrophy with myositis (mdm) 
mouse mutation (see, e.g, Garvey et al, 2002, Genomics 79(2): 146-9), the mdx mouse 
(see, e.g., Nakamura et al, 2001 , Neuromuscul Disord 1 1(3);251 -9), the utrophin- 
dystrophin knockout (dko) mouse (see, e.g, Nakamura et al, 2001, Neuromuscul Disord 
1 1(3):251-9), the dy/dy mouse (see, e.g, Dubowitz et al, 2000, Neuromuscul Disord 10(4- 
5):292-8), the mdx(Cv3) mouse model (see, e.g., Pillers et al, 1999, Laryngoscope 
109(8): 13 10-2), and the myotonic ADR-MDX mutant mice (see, e.g, Kramer et al, 1998, 
Neuromuscul Disord 8(8):542-50). Examples of hamster models for muscular dystrophy 
include, but are not limited to, sarcoglycan-deficient hamsters (see, e.g., Nakamura et al, 
2001, Am J Physiol Cell Physiol 281(2):C690-9) and the BIO 14.6 dystrophic hamster (see, 
e.g., Schlenker & Burbach, 1991, J Appl Physiol 71 (5): 1655-62). An example of a feline 
model for muscular dystrophy includes, but is not limited to, the hypertrophic feline 
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muscular dystrophy model (see, e.g., Gaschen & Burgunder, 2001, ActaNeuropathol (Berl) 
1 01(6):591 -600). Canine models for muscular dystrophy include, but are not limited to, 
golden retriever muscular dystrophy (see, e.g., Fletcher et al, 2001, Neuromuscul Disord 
1 l(3):239-43) and canine X-linked muscular dystrophy (see, e.g., Valentine et at, 1992, 
Am J Med Genet 42(3) :3 52-6). Examples of C. elegans models for muscular dystrophy are 
described in Chamberlain & Benian, 2000, Curr Biol 10(21):R795-7 and Culette & Sattelle, 
2000, Hum Mol Genet 9(6):869-77. 

Examples of animal models for familial hypercholesterolemia include, but 
are not limited to, mice lacking functional LDL receptor genes (see, e.g., Aji et al, 1997, 
Circulation 95(2):430-7), Yoshida rats (see, e.g., Fantappie et al, 1992, Life Sci 
50(24): 191 3-24), the JCR:LA-cp rat (see, e.g., Richardson et al, 1998, Atherosclerosis 
138(l):135-46), swine (see, e.g., Hasler-Rapacz et al, 1998, Am J Med Genet 76(5):379- 
86), and the Watanabe heritable hyperlipidaemic rabbit (see, e.g., Tsutsumi et al, 2000, 
Arzneimittelforschung 50(2): 11 8-21 ; Harsch etal, 1998, Br JPharmacol 124(2):227-82; 
and Tanaka et al, 1995, Atherosclerosis 1 14(l):73-82). 

An example of an animal model for human cancer in general includes, but is 
not limited to, spontaneously occurring tumors of companion animals (see, e.g., Vail & 
MacEwen, 2000, Cancer Invest 18(8):781-92). Examples of animal models for lung cancer 
include, but are not limited to, lung cancer animal models described by Zhang & Roth 
(1994, In Vivo 8(5):755-69) and a transgenic mouse model with disrupted p53 function 
(see, e.g, Morris <tf al, 1998, J La State Med Soc 150(4): 179-85). An example of an 
animal model for breast cancer includes, but is not limited to, a transgenic mouse that 
overexpresses cyclin Dl (see, e.g., Hosokawa et al, 2001, Transgenic Res 10(5):471-8). 
An example of an animal model for colon cancer includes, but is not limited to, a TCRbeta 
and p53 double knockout mouse (see, e.g., Kado et al, 2001, Cancer Res 61(6):2395-8). 
Examples of animal models for pancreatic cancer include, but are not limited to, a 
metastatic model of Panc02 murine pancreatic adenocarcinoma (see, e.g., Wang et al, 2001, 
Int J Pancreatol 29(l):37-46) and nu-nu mice generated in subcutaneous pancreatic tumours 
(see, e.g., Ghaneh et al, 2001, Gene Ther 8(3): 199-208). Examples of animal models for 
non-Hodgkin's lymphoma include, but are not limited to, a severe combined 
immunodeficiency ("SCID") mouse (see, e.g., Bryant et al, 2000, Lab Invest 80(4):553-73) 
and an IgHmu-HOXl 1 transgenic mouse (see, e.g., Hough et al, 1998, Proc Natl Acad Sci 
USA 95(23): 13 853-8). An example of an animal model for esophageal cancer includes, but 
is not limited to, a mouse transgenic for the human papillomavirus type 16 E7 oncogene 
(see, e.g., Herber etal, 1996, J Virol 70(3): 1873-81). Examples of animal models for 
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colorectal carcinomas include, but are not limited to, Ape mouse models (see, e.g., Fodde & 
Smits, 2001, Trends Mol Med 7(8):369-73 and Kuraguchi et ah, 2000, Oncogene 
19(50):5755-63). An example of an animal model for neurofibromatosis includes, but is not 
limited to, mutant NF1 mice (see, e.g. , Cichowski et ah, 1996, Semin Cancer Biol 7(5):29 1 - 
8). Examples of animal models for retinoblastoma include, but are not limited to, 
transgenic mice that expression the simian virus 40 T antigen in the retina (see, e.g., Howes 
et al., 1994, Invest Ophthalmol Vis Sci 35(2):342-51 and Windle et al, 1990, Nature 
343(6259):665-9) and inbred rats (see, e.g., Nishida et al, 1981, Curr Eye Res l(l):53-5 
and Kobayashi et al, 1982, Acta Neuropathol (Bed) 57(2-3):203-8). Examples of animal 
models for Wilm's tumor include, but are not limited to, a WT1 knockout mice (see, e.g., 
Schamhorst et al., 1997, Cell Growth Differ 8(2): 133-43), a rat subline with a high 
incidence of neuphroblastoma (see, e.g., Mesfin & Breech, 1996, Lab Anim Sci 46(3):321- 
6), and a Wistar/Furth rat with Wilms' tumor (see, e.g., Murphy et ah, 1987, Anticancer Res 
15 7(4B):717-9). 

Examples of animal models for retinitis pigmentosa include, but are not 
limited to, the Royal College of Surgeons ("RCS") rat (see, e.g., Vollrath et al., 2001, Proc 
Natl Acad Sci USA 98(22); 12584-9 and Hanitzsch et al., 1998, Acta Anat (Basel) 162(2- 

3) :1 19-26), a rhodopsin knockout mouse (see, e.g., Jaissle et ah, 2001, Invest Ophthalmol 
Vis Sci 42(2):506-13), and Wag/Rij rats (see, e.g, Lai et ah, 1980, Am J Pathol 98(1):281- 

4) . 

Examples of animal models for cirrhosis include, but are not limited to, 
CCl 4 -exposed rats (see, e.g., Kloehn et ah, 2001, Horm Metab Res 33(7):394-401) and 
rodent models instigated by bacterial cell components or colitis (see, e.g., Vierling, 2001, 

25 Best Pract Res Clin Gastroenterol 1 5(4):591-610). 

Examples of animal models for hemophilia include, but are not limited to, 
rodent models for hemophilia A (see, e.g., Reipert et ah, 2000, Thromb Haemost 84(5):826- 
32; Jarvis et ah,. 1996, Thromb Haemost 75(2):3 18-25; and Bi et ah, 1995, Nat Genet 
10(1):1 19-21), canine models for hemophilia A (see, e.g., Gallo-Penn et al, 1999, Hum 

3Q Gene Ther 10(1 1):1791-802 and Connelly et al, 1998, Blood 91(9);3273-81), murine 
models for hemophilia B (see, e.g, Snyder et ah, 1999, Nat Med 5(l):64-70; Wang et ah, 
1997, Proc Natl Acad Sci USA 94(21): 1 1563-6; and Fang et ah, 1996, Gene Ther 3(3):217- 
22), canine models for hemophilia B (see, e.g., Mount et ah, 2002, Blood 99(8):2670-6; 
Snyder et ah, 1999, Nat Med 5(l):64-70; Fang et ah, 1996, Gene Ther 3(3):217-22); and 

35 Kay etah, 1994, Proc Natl Acad Sci USA 91(6):2353-7), and a rhesus macaque model for 
hemophilia B (see, e.g., Lozier et ah, 1999, Blood 93(6):1875-81). 
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Examples of animal models for von Willebrand disease include, but are not 
limited to, an inbred mouse strain RHIS/J (see, e.g., Nichols el al., 1994, 83(1 1):3225-31 
and Sweeney et al, 1990, 76(1 1):2258-65), rats injected with botrocetin (see, e.g., Sanders 

5 et al, 1988, Lab Invest 59(4):443-52), and porcine models for von Willebrand disease (see, 
e.g., Nichols et al, 1995, Proc Natl Acad Sci USA 92(7):2455-9; Johnson & Bowie, 1992, J 
Lab Clin Med 120(4):553-8); and Brinkhous et al, 1991, Mayo Clin Proc 66(7):733-42). 

Examples of animal models for ^-thalassemia include, but are not limited to, 
murine models with mutations in globin genes (see, e.g., Lewis et al, 1998, Blood 

10 91(6):2152-6; Rajaef a/., 1994, Br J Haematol 86(l):156-62; Popp etal, 1985, 445:432-44; 
and Skow era/., 1983, Cell 34(3): 1043-52). 

Examples of animal models for kidney stones include, but are not limited to, 
genetic hypercalciuric rats (see, e.g., Bushinsky et al, 1999, Kidney Int 55(l):234-43 and 
Bushinsky et al, 1995, Kidney Int 48(6):1705-13), chemically treated rats (see, e.g., Grases 

15 et al, 1998, Scand J Urol Nephrol 32(4):261-5; Burgess et al, 1995, Urol Res 23(4):239- 
42; Kumar etal, 1991, JUrol 146(5):1384-9; Okada era/., 1985, Hinyokika Kiyo 
31(4):565-77; and Bluestone et al, 1975, Lab Invest 33(3):273-9), hyperoxaluric rats (see, 
e.g., Jones et al., 1991, J Urol 145(4):868-74), pigs with unilateral retrograde flexible 
nephroscopy (see, e.g., Seifmah et al., 2001, 57(4):832-6), and rabbits with an obstructed 

20 u PP er urinary tract (see, e.g., Itatani et al, 1979, Invest Urol 17(3):234-40). 

Examples of animal models for ataxia-telangiectasia include, but are not 
limited to, murine models of ataxia-telangiectasia (see, e.g. , Barlow et al, 1 999, Proc Natl 
Acad Sci USA 96(17):9915-9 and Inoue etal, 1986, Cancer Res 46(8):3979-82). 

Compounds displaying the desired biological activity can be considered to be 

2^ lead compounds, and will be used in the design of congeners or analogs possessing useful 
pharmacological activity and physiological profiles. Following the identification of a lead 
compound, molecular modeling techniques can be employed, which have proven to be 
useful in conjunction with synthetic efforts, to design variants of the lead that can be more 
effective. These applications may include, but are not limited to, Pharmacophore Modeling 

30 (cf. Lamothe, et al. 1997, J. Med. Chem. 40: 3542; Mottola et al. 1996, J. Med. Chem. 39: 
285; Beusen etal. 1995, Biopolymers 36: 181; P. Fossa etal. 1998, Comput. Aided Mol. 
Des. 12: 361), QSAR development (cf Siddiqui et al. 1999, J. Med. Chem. 42: 4122; 
Barreca et al. 1999 Bioorg. Med. Chem. 7: 2283; Kroemer et al. 1995, J. Med. Chem. 38: 
4917; Schaal et al. 2001, J. Med. Chem. 44: 155; Buolamwini & Assefa 2002, J. Mol. 
35 Chem. 45: 84), Virtual docking and screening/scoring (cf. Anzini et al. 2001, J. Med. Chem. 
44: 1134; Faaland et al. 2000, Biochem. Cell. Biol. 78: 415; Silvestri etal. 2000, Bioorg. 
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Med Chem. 8: 2305; J. Lee et al 2001, Bioorg. Med. Chem. 9: 19), and Structure 
Prediction using RNA structural programs including, but not limited to mFold (as described 
by Zuker et al Algorithms and Thermodynamics for RNA Secondary Structure Prediction: 
A Practical Guide in RNA Biochemistry and Biotechnology pp. 1 1-43, J. Barciszewski & 
B.F.C. Clark, eds. (NATO ASI Series, Kluwer Academic Publishers, 1999) and Mathews et 
at. 1999 J. Mol. Biol. 288: 91 1-940); RNAmotif (Macke et al 2001, Nucleic Acids Res. 29: 
4724-4735; and the Vienna RNA package (Hofacker et al 1994, Monatsh. Chem. 125: 
167-188). 



several review articles, such as Rotivinen et al, 1988, Acta Pharmaceutical Fennica 
97:159-166; Ripka, 1998, New Scientist 54-57; McKinaly & Rossmann, 1989, Annu. Rev. 
Pharmacol. Toxiciol. 29:111-122; Perry & Davies, QSAR: Quantitative Structure-Activity 
Relationships in Drug Design pp. 189-193 (Alan R. Liss, Inc. 1989); Lewis & Dean, 1989, 
Proc. R. Soc. Lond. 236:125-140 and 141-162; Askew et al., 1989, J. Am. Chem. Soc. 
1 1 1:1082-1090. Molecular modeling tools employed may include those from Tripos, Inc., 
St. Louis, Missouri {e.g., SybylAJNITY, CONCORD, DiverseSolutions), Accelerys, San 
Diego, California (e.g., Catalyst, Wisconsin Package {BLAST, etc.}), Schrodinger, 
Portland, Oregon (e.g., QikProp, QikFit, Jaguar) or other such vendors as BioDesign, Inc. 
(Pasadena, California), Allelix, Inc. (Mississauga, Ontario, Canada), and Hypercube, Inc. 
(Cambridge, Ontario, Canada), and may include privately designed and/or "academic" 
software (e.g. RNAMotif, mFOLD). These application suites and programs include tools 
for the atomistic construction and analysis of structural models for drug-like molecules, 
proteins, and DNA or RNA and their potential interactions. They also provide for the 
calculation of important physical properties, such as solubility estimates, permeability 
metrics, and empirical measures of molecular "draggability" (e.g., Lipinski "Rule of 5" as 
described by Lipinski et al 1997, Adv. Drug Delivery Rev. 23: 3-25). Most importantly, 
they provide appropriate metrics and statistical modeling power (such as the patented 
CoMFA technology in Sybyl as described in US Patents 6,240,374 and 6,185,506) to 
develop Quantitative Structural Activity Relationships (QSARs) which are used to guide the 
synthesis of more efficacious clinical development candidates while improving desirable 
physical properties, as determined by results from the aforementioned secondary screening 
protocols. 



Further examples of the application of such techniques can be found in 
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5.8. Use of Identified Compounds That Bind RNA to Treat/Prevent Disease 

Biologically active compounds identified using the methods of the invention 
or a pharmaceutically acceptable salt thereof can be administered to a patient, preferably a 

^ mammal, more preferably a human, suffering from a disease whose progression is 

associated with premature translation termination and/or nonsense-mediated mRNA decay. 
In certain embodiments, such compounds or a pharmaceutically acceptable salt thereof is 
administered to a patient, preferably a mammal, more preferably a human, as a preventative 
measure against a disease associated with premature translation termination and/or 

10 nonsense-mediated mRNA decay. 

In a preferred embodiment, it is first determined that the patient is suffering 
from a disease associated with premature translation termination and/or nonsense-mediated 
mRNA decay. In a preferred embodiment, the DNA of the patient can be sequenced or 
subject to Southern blot, polymerase chain reaction ("PCR"), use of the Short Tandem 

j 5 Repeat ("STR"), or polymorphic length restriction fragments ("RFLP") analysis to 

determine if a nonsense mutation is present in the DNA of the patient. Alternatively, it can 
be determined if altered levels of the protein with the nonsense mutation are expressed in 
the patient by western blot or other immunoassays. Such methods are well known to one of 
skill in the art. 

20 In one embodiment, "treatment 11 or "treating" refers to an amelioration of a 

disease, or at least one discernible symptom thereof. In another embodiment, "treatment" or 
"treating" refers to an amelioration of at least one measurable physical parameter, not 
necessarily discernible by the patient In yet another embodiment, "treatment" or "treating" 
refers to inhibiting the progression of a disease, either physically, e.g., stabilization of a 

25 discernible symptom, physiologically, e.g. , stabilization of a physical parameter, or both. In 
yet another embodiment, "treatment" or "treating" refers to delaying the onset of a disease. 

In certain embodiments, the compound or a pharmaceutically acceptable salt 
thereof is administered to a patient, preferably a mammal, more preferably a human, as a 
preventative measure against a disease associated with a disease caused and/or associated 

30 nonsense suppression. As used herein, "prevention" or "preventing" refers to a 
reduction of the risk of acquiring a disease. In one embodiment, the compound or a 
pharmaceutically acceptable salt thereof is administered as a preventative measure to a 
patient. According to this embodiment, the patient can have a genetic predisposition to a 
disease, such as a family history of the disease, or a non-genetic predisposition to the 

35 disease. Accordingly, the compound and pharmaceutically acceptable salts thereof can be 
used for the treatment of one manifestation of a disease and prevention of another. 
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In a preferred embodiment, the compounds identified vising the methods of 
the present invention are used to treat or prevent a disease caused by one or more nonsense 
mutations. Examples of diseases caused by nonsense mutations include, but are not limited 
to, cystic fibrosis, muscular dystrophy, heart disease, lung cancer, breast cancer, colon 
cancer, pancreatic cancer, non-Hodgkin's lymphoma, ovarian cancer, esophageal cancer, 
colorectal carcinomas, neurofibromatosis, retinoblastoma, Wilm's tumor, retinitis 
pigmentosa, collagen disorders, cirrhosis, Tay-Sachs disease, blood disorders, kidney 
stones, and ataxia-telangiectasia. Genes that contain one or more nonsense mutations that 
are potentially involved in causing disease are presented in table form according to 
chromosome location in Example 9 infra. 

When administered to a patient, the compound or a pharmaceutically 
acceptable salt thereof is preferably administered as component of a composition that 
optionally comprises a pharmaceutically acceptable vehicle. The composition can be 
administered orally, or by any other convenient route, for example, by infusion or bolus 
injection, by absorption through epithelial or mucocutaneous linings (e.g., oral mucosa, 
rectal, and intestinal mucosa, etc.) and may be administered together with another 
biologically active agent. Administration can be systemic or local. Various delivery 
systems are known, e.g., encapsulation in liposomes, microparticles, microcapsules, 
capsules, etc., and can be used to administer the compound and pharmaceutically acceptable 
salts thereof. 

Methods of administration include but are not limited to intradermal, 
intramuscular, intraperitoneal, intravenous, subcutaneous, intranasal, epidural, oral, 
sublingual, intranasal, intracerebral, intravaginal, transdermal, rectally, by inhalation, or 
topically, particularly to the ears, nose, eyes, or skin. The mode of administration is left to 
the discretion of the practitioner. In most instances, administration will result in the release 
of the compound or a pharmaceutically acceptable salt thereof into the bloodstream. 

In specific embodiments, it may be desirable to administer the compound or 
a pharmaceutically acceptable salt thereof locally. This may be achieved, for example, and 
not by way of limitation, by local infusion during surgery, topical application, e.g., in 
conjunction with a wound dressing after surgery, by injection, by means of a catheter, by 
means of a suppository, or by means of an implant, said implant being of a porous, non- 
porous, or gelatinous material, including membranes, such as sialastic membranes, or fibers. 

In certain embodiments, it may be desirable to introduce the compound or a 
pharmaceutically acceptable salt thereof into the central nervous system by any suitable 
route, including intraventricular, intrathecal and epidural injection. Intraventricular 
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injection may be facilitated by an intraventricular catheter, for example, attached to a 
reservoir, such as an Ommaya reservoir. 

Pulmonary administration can also be employed, e.g., by use of an inhaler or 
^ nebulizer, and formulation with an aerosolizing agent, or via perfusion in a fluorocarbon or 
synthetic pulmonary surfactant In certain embodiments, the compound and 
pharmaceutically acceptable salts thereof can be formulated as a suppository, with 
traditional binders and vehicles such as triglycerides. 

In another embodiment, the compound and pharmaceutically acceptable salts 
I q thereof can be delivered in a vesicle, in particular a liposome (see Langer, 1 990, Science 
249:1527-1533; Treat et ai y in Liposomes in the Therapy of Infectious Disease and Cancer, 
Lopez-Berestein and Fidler (eds.), Liss, New York, pp. 353-365 (1989); Lopez-Berestein, 
ibid, pp. 317-327; see generally ibid). 

In yet another embodiment, the compound and pharmaceutically acceptable 

j 5 salts thereof can be delivered in a controlled release system (see, e.g. , Goodson, in Medical 
Applications of Controlled Release, supra, vol. 2, pp. 1 15-138 (1984)). Other controlled- 
release systems discussed in the review by Langer, 1990, Science 249: 1527-1533) may be 
used. In one embodiment, a pump may be used (see Langer, supra; Sefton, 1987, CRC Crit. 
Ref. Biomed. Eng. 14:201; Buchwald etal, 1980, Surgery 88:507 Saudek etal, 1989, N. 

2 q Engl. J. Med. 321 :574). In another embodiment, polymeric materials can be used (see 
Medical Applications of Controlled Release, Langer and Wise (eds.), CRC Pres., Boca 
Raton, Florida (1974); Controlled Drug Bioavailability, Drug Product Design and 
Performance, Smolen and Ball (eds.), Wiley, New York (1984); Ranger and Peppas, 1983, 
J. Macromol. Sci. Rev. MacromoL Chem. 23:61 ; see also Levy et al 9 1985, Science 

25 228:190; During et aU 1989, Ann. Neurol. 25:351; Howard et al y 1989, J. Neurosurg. 

71 : 105). In yet another embodiment, a controlled-release system can be placed in proximity 
of a target RNA of the compound or a pharmaceutically acceptable salt thereof, thus 
requiring only a fraction of the systemic dose. 

Compositions comprising the compound or a pharmaceutically acceptable 

3Q salt thereof ("compound compositions") can additionally comprise a suitable amount of a 
pharmaceutically acceptable vehicle so as to provide the form for proper administration to 
the patient. 

In a specific embodiment, the term "pharmaceutically acceptable" means 
approved by a regulatory agency of the Federal or a state government or listed in the U.S. 
35 Pharmacopeia or other generally recognized pharmacopeia for use in animals, mammals, 
and more particularly in humans. The term "vehicle" refers to a diluent, adjuvant, excipient, 
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or carrier with which a compound of the invention is administered. Such pharmaceutical 
vehicles can be liquids, such as water and oils, including those of petroleum, animal, 
vegetable or synthetic origin, such as peanut oil, soybean oil, mineral oil, sesame oil and the 
like. The pharmaceutical vehicles can be saline, gum acacia, gelatin, starch paste, talc, 
keratin, colloidal silica, urea, and the like. In addition, auxiliary, stabilizing, thickening, 
lubricating and coloring agents may be used. When administered to a patient, the 
pharmaceutically acceptable vehicles are preferably sterile. Water is a preferred vehicle 
when the compound of the invention is administered intravenously. Saline solutions and 
aqueous dextrose and glycerol solutions can also be employed as liquid vehicles, 
particularly for injectable solutions. Suitable pharmaceutical vehicles also include 
excipients such as starch, glucose, lactose, sucrose, gelatin, malt, rice, flour, chalk, silica 
gel, sodium stearate, glycerol monostearate, talc, sodium chloride, dried skim milk, 
glycerol, propylene, glycol, water, ethanol and the like. Compound compositions, if 
desired, can also contain minor amounts of wetting or emulsifying agents, or pH buffering 
agents. 

Compound compositions can take the form of solutions, suspensions, 
emulsion, tablets, pills, pellets, capsules, capsules containing liquids, powders, sustained- 
release formulations, suppositories, emulsions, aerosols, sprays, suspensions, or any other 
form suitable for use. In one embodiment, the pharmaceutically acceptable vehicle is a 
capsule (see e.g., U.S. Patent No. 5,698,155). Other examples of suitable pharmaceutical 
vehicles are described in Remington's Pharmaceutical Sciences, Alfonso R. Gennaro, ed., 
Mack Publishing Co. Easton, PA, 19th ed., 1995, pp. 1447 to 1676, incorporated herein by 
reference. 

In a preferred embodiment, the compound or a pharmaceutically acceptable 
salt thereof is formulated in accordance with routine procedures as a pharmaceutical 
composition adapted for oral administration to human beings. Compositions for oral 
delivery may be in the form of tablets, lozenges, aqueous or oily suspensions, granules, 
powders, emulsions, capsules, syrups, or elixirs, for example. Orally administered 
compositions may contain one or more agents, for example, sweetening agents such as 
fructose, aspartame or saccharin; flavoring agents such as peppermint, oil of wintergreen, or 
cherry; coloring agents; and preserving agents, to provide a pharmaceutically palatable 
preparation. Moreover, where in tablet or pill form, the compositions can be coated to delay 
disintegration and absorption in the gastrointestinal tract thereby providing a sustained 
35 action over an extended period of time. Selectively permeable membranes surrounding an 
osmotically active driving compound are also suitable for orally administered compositions. 
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In these later platforms, fluid from the environment surrounding the capsule is imbibed by 
the driving compound, which swells to displace the agent or agent composition through an 
aperture. These delivery platforms can provide an essentially zero order delivery profile as 

5 opposed to the spiked profiles of immediate release formulations. A time delay material 
such as glycerol monostearate or glycerol stearate may also be used. Oral compositions can 
include standard vehicles such as mannitol, lactose, starch, magnesium stearate, sodium 
saccharine, cellulose, magnesium carbonate, and the like. Such vehicles are preferably of 
pharmaceutical grade. Typically, compositions for intravenous administration comprise 

j q sterile isotonic aqueous buffer. Where necessary, the compositions may also include a 
solubilizing agent. 

In another embodiment, the compound or a pharmaceutically acceptable salt 
thereof can be formulated for intravenous administration. Compositions for intravenous 
administration may optionally include a local anesthetic such as lignocaine to lessen pain at 

^ the site of the injection. Generally, the ingredients are supplied either separately or mixed 
together in unit dosage form, for example, as a dry lyophilized powder or water-free 
concentrate in a hermetically sealed container such as an ampoule or sachette indicating the 
quantity of active agent Where the compound or a pharmaceutically acceptable salt thereof 
is to be administered by infusion, it can be dispensed, for example, with an infusion bottle 

2q containing sterile pharmaceutical grade water or saline. Where the compound or a 

pharmaceutically acceptable salt thereof is administered by injection, an ampoule of sterile 
water for injection or saline can be provided so that the ingredients may be mixed prior to 
administration. 

The amount of a compound or a pharmaceutically acceptable salt thereof that 
25 will be effective in the treatment of a particular disease will depend on the nature of the 
disease, and can be determined by standard clinical techniques. In addition, in vitro or in 
vivo assays may optionally be employed to help identify optimal dosage ranges. The precise 
dose to be employed will also depend on the route of administration, and the seriousness of 
the disease, and should be decided according to the judgment of the practitioner and each 
patient f s circumstances. However, suitable dosage ranges for oral administration are 
generally about 0.001 milligram to about 200 milligrams of a compound or a 
pharmaceutically acceptable salt thereof per kilogram body weight per day. In specific 
preferred embodiments of the invention, the oral dose is about 0.0 1 milligram to about 100 
milligrams per kilogram body weight per day, more preferably about 0.1 milligram to about 
35 75 milligrams per kilogram body weight per day, more preferably about 0.5 milligram to 5 
milligrams per kilogram body weight per day. The dosage amounts described herein refer 
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to total amounts administered; that is, if more than one compound is administered, or if a 
compound is administered with a therapeutic agent, then the preferred dosages correspond 
to the total amount administered. Oral compositions preferably contain about 10% to about 
95% active ingredient by weight 

Suitable dosage ranges for intravenous (i.v.) administration are about 0.01 
milligram to about 100 milligrams per kilogram body weight per day, about 0.1 milligram to 
about 35 milligrams per kilogram body weight per day, and about 1 milligram to about 10 
milligrams per kilogram body weight per day. Suitable dosage ranges for intranasal 
administration are generally about 0.01 pg/kg body weight per day to about 1 mg/kg body 
weight per day. Suppositories generally contain about 0.01 milligram to about 50 
milligrams of a compound of the invention per kilogram body weight per day and comprise 
active ingredient in the range of about 0.5% to about 10% by weight. 

Recommended dosages for intradermal, intramuscular, intraperitoneal, 
subcutaneous, epidural, sublingual, intracerebral, intravaginal, transdermal administration or 
administration by inhalation are in the range of about 0.001 milligram to about 200 
milligrams per kilogram of body weight per day. Suitable doses for topical administration 
are in the range of about 0.001 milligram to about 1 milligram, depending on the area of 
administration. Effective doses may be extrapolated from dose-response curves derived 
from in vitro or animal model test systems. Such animal models and systems are well 
known in the art. 

The compound and pharmaceutically acceptable salts thereof are preferably 
assayed in vitro and in vivo, for the desired therapeutic or prophylactic activity, prior to use 
in humans. For example, in vitro assays can be used to determine whether it is preferable to 
administer the compound, a pharmaceutically acceptable salt thereof, and/or another 
therapeutic agent. Animal model systems can be used to demonstrate safety and efficacy. 

A variety of compounds can be used for treating or preventing diseases in 
mammals. Types of compounds include, but are not limited to, peptides, peptide analogs 
including peptides comprising non-natural amino acids, e.g., D-amino acids, phosphorous 
analogs of amino acids, such as a-amino phosphonic acids and a-amino phosphinic acids, 
or amino acids having non-peptide linkages, nucleic acids, nucleic acid analogs such as 
phosphorothioates or peptide nucleic acids ("PNAs"), hormones, antigens, synthetic or 
naturally occurring drugs, opiates, dopamine, serotonin, catecholamines, thrombin, 
acetylcholine, prostaglandins, organic molecules, pheromones, adenosine, sucrose, glucose, 
lactose and galactose. 
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6. EXAMPLE: IDENTIFICATION OF A DYE-LABELED TARGET RNA 

BOUND TO SM ALL MOL ECULAR WEIGHT COMPOUNDS 

The results presented in this Example indicate that gel mobility shift assays 
can be used to detect the binding of small molecules, such as the Tat peptide and 
gentamicin, to their respective target RNAs. 



6.1. Materials and Methods 



10 6.1.1. Buffers 

Tris-potassium chloride (TK) buffer is composed of 50 mM Tris-HCl pH 
7.4, 20mM KC1, 0.1%Triton X-100, and 0.5mM MgCl 2 . Tris-borate-EDTA (TBE) buffer is 
composed of 45 mM Tris-borate pH 8.0, and 1 mM EDTA. Tris-Potassium chloride- 
magnesium (TKM) buffer is composed of 50 mM Tris-HCl pH 7.4, 20mM KC1, 0.1%Triton 
15 X-100 and 5mMMgCl 2 . 



6.1.2. Gel retardation analysis 
RNA oligonucleotides were purchased from Dharmacon, Inc, Lafayette, 
CO). 500 pmole of either a 5' fluorescein labeled oligonucleotide corresponding to the 16S 

20 rRNA A site (5'-GGCGUCACACCUUCGGGUGAAGUCGCC-3' (SEQ ID NO: 1); 
Moazed & Noller, 1987, Nature 327:389-394; Woodcock et al, 1991, EMBO J. 10:3099- 
3103; Yoshizawa et al 9 1998, EMBO J. 17:6437-6448) or a 5' fluorescein labeled 
oligonucleotide corresponding to the HTV-1 TAR element TAR RNA (5'- 
GGCAGAUCUGAGCCUGGGAGCUCUCUGCC-3 > (SEQ ID NO: 2); Huq et al, 1999, 

25 Nucleic Acids Research. 27:1084-1093; Hwang etal, 1999, Proc. Natl. Acad. Sci. USA 
96:12997-13002) was 3' labeled with 5'- 32 P cytidine 3\ 5*-bis(phosphate) (NEN) and T4 
RNA ligase (NEBiolabs) in 10% DMSO as per manufacturer's instructions. The labeled 
oligonucleotides were purified using G-25 Sephadex columns (Boehringer Mannheim). For 
Tat-TAR gel retardation reactions the method of Huq et al (Nucleic Acids Research, 1999, 

30 27:1084-1093) was utilized with TK buffer containing 0.5mM MgCl 2 and a 12-mer Tat 
peptide (YGRKKRRQRRRP (SEQ ID NO: 3; single letter amino acid code). For 16S 
rRNA-gentamicin reactions, the method of Huq et al was used with TKM buffer. In 20 \xl 
reaction volumes 50 pmoles of 32 P cytidine-labeled oligonucleotide and either gentamicin 
sulfate (Sigma) or the short Tat peptide (Tat 47 . 58 ) in TK or TKM buffer were heated at 90°C 

35 for 2 minutes and allow to cool to room temperature (approximately 24 °C) over 2 hours. 
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Then 10 nl of 30% glycerol was added to each reaction tube and the entire sample was 
loaded onto a TBE non-denaturing polyacrylamide gel and electrophoresed at 1200-1600 
volt-hours at 4°C. The gel was exposed to an intensifying screen and radioactivity was 
quantitated using a Typhoon phosporimager (Molecular Dynamics). 

6.2. Background 

t One method used to demonstrate small molecule interactions with natural 

occurring RNA structures such as ribosomes is by a method called chemical footprinting or 
toe printing (Moazed & Noller, 1987, Nature 327:389-394; Woodcock et al, 1991, EMBO 
J. 10:3099-3103; Yoshizawa etal, 1998, EMBO J. 17:6437-6448). Here the use of gel 
mobility shift assays to monitor RNA-small molecule interactions are described. This 
approach allows for rapid visualization of small molecule-RNA interactions based on the 
difference between mobility of RNA alone versus RNA in a complex with a small 
molecule. To validate this approach, an RNA oligonucleotide corresponding to the well- 
characterized gentamicin binding site on the 16S rRNA (Moazed & Noller, 1987, Nature 
327:389-394) and the equally well-characterized HIV-l TAT protein binding site on the 
HTV-1 TAR element (Huq et al., 1999, Nucleic Acids Res. 27: 1084-1093) were chosen. 
The purpose of these experiments is to lay the groundwork for the use of chromatographic 
techniques in a high throughput fashion, such as microcapillary electrophoresis, for drug 
discovery. 



6.3. Results 

A gel retardation assay was performed using the Tat 47 . 58 peptide and the TAR 
RNA oligonucleotide. As shown in Figure 2, in the presence of the Tat peptide, a clear shift 
is visible when the products are separated on a 12% non-denaturing polyacrylamide gel. In 
the reaction that lacks peptide, only the free RNA is visible. These observations confirm 
previous reports made using other Tat peptides (Hamy et al. 9 1997, Proc. Natl. Acad. ScL 
USA 94:3548-3553; Huq et al y 1999, Nucleic Acids Res. 27: 1084-1093). 

Based on the results of Figure 2, it was hypothesized that RNA interactions 
with small organic molecules could also be visualized using this method. As shown in 
Figure 3, the addition of varying concentrations of gentamicin to an RNA oligonucleotide 
corresponding to the 16S rRNA A site produces a mobility shift. These results demonstrate 
that the binding of the small molecule gentamicin to an RNA oligonucleotide having a 
defined structure in solution can be monitored using this approach. In addition, as shown in 
Figure 3, a concentration as low as lOng/ml gentamicin produces the mobility shift. 
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To detemiine whether lower concentrations of gentamicin would be 
sufficient to produce a gel shift, similar experiment was performed, as shown in Figure 3, 
except that the concentrations of gentamicin ranged from 100 ng/ml to 10 pg/ml. As shown 
in Figure 4, gel mobility shifts are produced when the gentamicin concentration is as low as 
10 pg/ml. Further, the results shown in Figure 4 demonstrate that the shift is specific to the 
16S rRNA oligonucleotide as the use of an unrelated oligonucleotide, corresponding to the 
HIV TAR RNA element, does not result in a gel mobility shift when incubated with 10 
jig/ml gentamicin. In addition, if a concentration as low as 10 pg/ml gentamicin produces a 
gel mobility shift then it should be possible to detect changes to RNA structural motifs 
when small amounts of compound from a library of diverse compounds is screened in this 
fashion. 

Further analysis of the gentamicin-RNA interaction indicates that the 
interaction is Mg- and temperature dependent As shown in Figure 5, when MgCl 2 is not 
^ present (TK buffer), lmg/ml of gentamicin must be added to the reaction to produce a gel 
shift. 

Similarly, the temperature of the reaction when gentamicin is added is also 
important. When gentamicin is present in the reaction during the entire 
denaturation/renaturation cycle, that is, when gentamicin is added at 90C°C or 85°C, a gel 
shift is visualized (data not shown). In contrast, when gentamicin is added after the 
renaturation step has proceeded to 75 °C, a mobility shift is not produced. These results are 
consistent with the notion that gentamicin may recognize and interact with an RNA 
structure formed early in the renaturation process. 



20 



25 



7. EXAMPLE: IDENTIFICATION OF A DYE-LABELED TARGET RNA 

BOUND TO SMALL MOLECULAR WEIGHT COMPOUNDS 
BY CAPILLARY ELECTROPHORESIS 



The results presented in this Example indicate that interactions between a 
peptide and its target RNA, such as the Tat peptide and TAR RNA, can be monitored by gel 
2Q retardation assays in an automated capillary electrophoresis system. 

7.1. Materials and Methods 



7.1.1. Buffers 

Tris-potassium chloride (TK) buffer is composed of 50 mM Tris-HCl pH 
7.4, 20mM KC1, 0.1%Triton X-100, and 0.5mM MgCl 2 . Tris-borate-EDTA (TBE) buffer is 
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composed of 45 raM Tris-borate pH 8.0 S and 1 mM EDTA. Tris-Potassium chloride- 
magnesium (TKM) buffer is composed of 50 mM Tris-HCl pH 7.4, 20mM KC1, 0. l%Triton 
X-100and5mM MgCl 2 . 



CO)- 500 pmole of a 5' fluorescein labeled oligonucleotide corresponding to the HIV-1 
TAR element TAR RNA (5 , .GGCAGAUCUGAGCCUGGGAGCUCUCUGCC-3 , (SEQ 
ID NO: 2);Huqef aL, 1999, Nucleic Acids Research. 27:1084-1093; Hwang et aL, 1999, 
Proc. Natl. Acad. Sci. USA 96:12997-13002) was used. For Tat-TAR gel retardation 
reactions the method of Unqet aL (Nucleic Acids Research, 1999, 27:1084-1093) was 
utilized with TK buffer containing 0.5mM MgCl 2 and a 12-mer Tat peptide 
(YGRKKRRQRRRP (SEQ ID NO: 3); single letter amino acid code). In 20 jai reaction 
volumes 50 pmoles of labeled oligonucleotide and the short Tat peptide (Tat 47 . 58 ) in TK or 
TKM buffer were heated at 90°C for 2 minutes and allow to cool to room temperature 
(approximately 24 °C) over 2 hours. The reactions were loaded onto a SCE9610 automated 
capillary electrophoresis apparatus (SpectruMedix; State College, Pennsylvania). 



peptide and RNA can be monitored by gel retardation assays. It was hypothesized that 
interactions between a peptide and RNA could be monitored by gel retardation assays by an 
automated capillary electrophoresis system. To test this hypothesis, a gel retardation assay 
by an automated capillary electrophoresis system was performed using the Tat 47 . 58 peptide 
and the TAR RNA oligonucleotide. As shown in Figure 6 using the capillary 
electrophoresis system, in the presence of the Tat peptide, a clear shift is visible upon the 
addition of increasing concentrations of Tat peptide. In the reaction that lacks peptide, only 
a peak corresponding to the free RNA is observed. These observations confirm previous 
reports made using other Tat peptides (Hamy et aL, 1997, Proc. Natl. Acad. Sci. USA 
94:3548-3553; Huq etal, 1999, Nucleic Acids Res. 27: 1084-1093). 



5 



7.1.2. Gel Retardation Analysis Using C apillary E lectrophoresis 



RNA oligonucleotides were purchased from Dharmacon, Inc, Lafayette, 



20 



7.2. Results 

As presented in the previous Example in Section 6, interactions between a 



35 
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8. EXAMPLE: COMPOUNDS THAT MODULATE TRANSLATION 

TERMINATION BIND SPECIFIC REGIONS OF 28S rRNA 

Data is presented in this Example that demonstrates that specific regions of 
5 the 28S rRNA are involved in modulating translation termination in mammalian cells. 

Compounds that interact in these regions or modulate local changes within these regions of 
the ribosome (e.g., alter base pairing interactions, base modification or modulate binding of 
trans-acting factors that bind to these regions) have the potential to modulate translation 
termination. These regions are conserved from prokaryotes to eukaryotes, but the role of 
10 these regions in modulating translation termination has not been realized in eukaryotes. In 
bacteria, when a short RNA fragment, complementary to the E. coli 23 S rRNA segment 
comprising nucleotides 735 to 766 (in domain II), is expressed in vivo, suppression of UGA 
nonsense mutations, but not UAA of UAG, results (Chemyaeva et al, 1999, J Bacteriol 
1 81 :5257-5262). Other regions of the 23 S rRNA in E. coli have been implicated in 
1 5 nonsense suppression including the GTPase center in domain II (nt 1 034-1 120; Jemiolo et 
al, 1995, Proc. Nat. Acad. Sciv 92:12309-12313). 

8.1. Materials and Methods 

20 8.1-1. Small Molecules Involved in Modulating Translation Termination 

Small molecules involved injnodulating translation termination, Le. 9 
nonsense suppression, were used in the footprinting experiments presented in Figures 2 to 6 
and are listed as Compound A (molecular formula C l9 H 21 N0 4 ), Compound B (molecular 
formula C I9 H 21 N 2 0 5 ), Compound C (molecular formula C 12 H l5 N 5 0), Compound D 
25 (molecular formula C^H^C^Br), Compound E (molecular formula C l9 H 2I N0 4 ), Compound 
F, Compound G (molecular formula C l2 H, 5 N 5 0), Compound H (molecular formula 
C 2 3H I5 N0 5 ) > Compound I (molecular formula C 23 H t5 N0 5 ), Compound J, and Compound K. 



8.1.2. Preparation of a Translation Extract from HcLa cells 

30 HeLa S3 cells were grown to a density of 10 6 cells/ml in DMEM; 5%C0 2 , 

10%FBS, IX P/S in a spinner flask. Cells were harvested by spinning at lOOOXg. Cells 
were washed twice with phosphate buffered saline. The cell pellet was on ice for 12-24 
hours before proceeding. By letting the cells sit on ice, the activity of the extract is 
increased up to twenty-fold. The length of time on ice can range from 0 hours to 1 week. 

35 The cells were resuspended in 1 .5 volumes (packed cell volume) of hypotonic buffer (10 
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mM HEPES (KOH) pH 7.4; 15 mM KC1; 1.5 mM Mg(OAc)2; 0.5 mM Pefabloc (Roche); 2 
mM DTT). The cells were allowed to swell for 5 minutes on ice, dounce homogenized with 
10 to 100 strokes using a tight-fitting pestle, and spun for 10 minutes at 12000Xg at 4°C in 
a Sorvall SS-34 rotor. The supernatant was collected with a Pasteur pipet without 
disturbing the lipid layer, transferred into Eppendorf tubes (50 to 200 nl aiiquots), and 
immediately frozen in liquid nitrogen. 



8.13. Footprinting 

Ribosomes prepared from HeLa cells were incubated with the small 
molecules (at a concentration of 100 jiM), followed by treatment with chemical modifying 
agents (dimethyl sulfate [DMS] and kethoxal [KE]). Following chemical modification, 
rRNA was phenol-chloroform extracted, ethanol precipitated, analyzed in primer extension 
reactions using end-labeled oligonucleotides hybridizing to different regions of the rRNAs 
and resolved on 6% polyacrylamide gels. The probes used for primer extension cover the 
entire 18S (7 oligonucleotide primers), 28S (24 oligonucleotide primers), and 5S (one 
primer) rRNAs are presented in Table 1 (also see, e.g., Gonzalez et aL, 1985 Proc Natl Acad 
Sci USA. 82(22):7666-70 and McCallum & Maden, 1985, Biochem. J. 232 (3): 725-733). 
Controls in these experiments include DMSO (a control for changes in rRNA accessibility 
induced by DMSO), paromomycin (a marker for 18S rRNA binding), and anisomycin (a 
marker for 28S rRNA binding). 



Table 1: 


18S, 28S, and 5S rRNA primers 




5S#1 


AAAGCCTACAGCACCC 


SEQIDNO.:4 


28S#1 


TACTGAGGGAATCCTGG 


SEQIDNO.:5 


28S#2 


TTACCACCCGCTTTGGG 


SEQ ID NO.: 6 


28S#3 


GGGGGCGGGAAAGATCC 


SEQIDNO.:7 


28S#4 


CCCCGAGCCACCTTCCC 


SEQ ID NO.: 8 


28S#5 


GGCCCCGGGATTCGGCG 


SEQ ID NO.: 9 


28S#6 


CACTGGGGACAGTCCGC 


SEQ ID NO: 10 


28S#7 


CGCGGCGGGCGAGACGGG 


SEQ ID NO.: 11 


28S#8 


GAGGGAAACTTCGGAGGG ' 


SEQ ID NO: 12 


28S#9 


CATCGGGCGCCTTAACCC 


SEQ ID NO.: 13 
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28S#10 CGACGCACACCACACGC 

28S#11 C C AAG ATCTGC AC CTGC 

28S#12 TTACCGCACTGGACGCC 

28S#13 GCCAGAGGCTGTTCACC 

28S#14 TGGGGAGGGAGCGAGCGGCG 

28S#15 AAGGGCCCGGCTCGCGTCC 

28S#16 AGGGCGGGGGGACGAACCGC 

28S#17 TTAAACAGTCGGATTCCCCTGG 

28S#18 TTCATCCATTCATGCGCG 

28S#19 AGTAGTGGTATTTCACCGG 

28S#20 ACGGGAGGTTTCTGTCC 

15 28S#21 ACAATGATAGGAAGAGCCG 

28S#22 AGGCGTTCAGTCATAATCCC 

28S#23 TCCGCACCGGACCCCGGTCC 

28S#24 GGGCTAGTTGATTCGGCAGGTGAGTTG 

20 18S#1 TCTCCGGAATCGAACCCT 

1 8S#2 ATT ACC GCGGCTGCTGGC 

18S#3 TTGGCAAATGCTTTCGC 

18S#4 CCGTCAATTCCTTTAAGTTTC 

25 18S#5 AGGGCATCACAGACCTGTTAT 

18S#6 CGACGGGCGGTGTGTAC 

18S#7 CCGCAGGTTCACCTACGG 



SEQ ED NO.: 14 
SEQIDNO.: 15 
SEQIDNO.: 16 
SEQIDNO.: 17 
SEQIDNO.: 18 
SEQIDNO.: 19 
SEQIDNO.: 20 
SEQIDNO.: 21 
SEQ ID NO.: 22 
SEQIDNO.: 23 
SEQIDNO.: 24 
SEQ ID NO.: 25 
SEQIDNO.: 26 
SEQ ID NO.: 27 
SEQ ID NO.: 28 
SEQ ID NO: 29 
SEQIDNO.: 30 
SEQ ED NO.: 31 
SEQIDNO.: 32 
SEQ ID NO: 33 
SEQ ED NO.: 34 
SEQIDNO.: 35 



30 



8.2. Results 

The results of these foot-printing experiments (see, e.g., Figures 7 to 1 1) 
indicated that the small molecules involved in modulating translation termination alter the 
accessibility of the chemical modifying agents to specific nucleotides in the 28S rRNA. 
More specifically, the regions protected by the small molecules include a conserved region 
in the vicinity of the peptidyl transferase center (domain V, see, e.g., Figures 7 and 8) 
35 implicated in peptide bond fonnation and a conserved region in domain II (see, e.g., Figures 
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9, 10, and 1 1) that may interact with the peptidyl transferase center based on binding of 
vernamycin B to both these areas (Vannuffel et al., 1994, Nucleic Acids Res. 
22(21):4449-53). 

5 

9. EXAMPLE: HUMAN DISEASE GENES SORTED BY CHROMOSOMF. 



Table 2: Genes, Locations and Genetic Disorders on Chromosome 1 



10 


Gene 


GDB Accession ID 


OMIM Link 


ABCA4 


GDB:370748 


MACULAR DEGENERATION, SENILE 
STARGARDT DISEASE 1; STGD1 ATP 

TJTKTrjTXJI^ C* A TT> A XTG"D/ r YE>T r CI> . 
r>liNUllNVj LAool 1 1 JC/ 1 IvAJN orUK 1 Jbxv; 

ABCR RETINITIS PIGMENTOSA- 1 9; 
RP19 


1 c 
ID 


ABCD3 


GDB:131485 


PEROXISOMAL MEMBRANE PROTEIN 
l; PXMPl 






(jDd:1 18958 


ACYL-CoA DEHYDROGENASE, 
MEDIUM-CHAIN; ACADM 




AGL 


GDB: 132644 


GLYCOGEN STORAGE DISEASE HI 


20 


AGT 


GDB: 118750 


ANGIOTENSIN I; AGT 


ALDH4A1 


GDB:9958827 


HYPERPROLINEMIA, TYPE II 




ALPL 


GDB: 118730 


PHOSPHATASE, LIVER ALKALINE; 
ALPL HYPOPHOSPHATASIA, 
INFANTILE 


25 


AMPD1 


GDB: 119677 


ADENOSINE MONOPHOSPHATE 
DEAMDMASE-1; AMPD1 




APOA2 


GDB.l 19685 


APOLIPOPROTEIN A-H; APOA2 




AVSD1 


GDB:265302 


ATRIOVENTRICULAR SEPTAL 
DEFECT; AVSD 


30 


BRCD2 


GDB:9955322 


BREAST CANCER, DUCTAL, 2; BRCD2 


C1QA 


GDB: 119042 


COMPLEMENT COMPONENT 1, q 
SUBCOMPONENT, ALPHA 
POLYPEPTIDE; C1QA 




C1QB 


GDB: 119043 


COMPLEMENT COMPONENT 1, q 
SUBCOMPONENT, BETA 
POLYPEPTIDE; C1QB [ 
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Gene 


GDB Accession ID 


OMIM Link 


C1QG 


GDB:128132 


COMPLEMENT COMPONENT 1, q 
SUBCOMPONENT, GAMMA 
POLYPEPTIDE; CI QG 


C8A 


GDB:1 19735 


COMPLEMENT COMPONENT-8, 
DEFICIENCY OF 


C8B 


GDB: 119736 


COMPLEMENT COMPONENT-8, 
DEFICIENCY OF, TYPE H 


CACNA1S 


GDB: 126431 


CALCIUM CHANNEL, 
VOLTAGE-DEPENDENT, L TYPE, 
ALPHA IS SUBUNIT; CACNA1S 
PERIODIC PARALYSIS I MALIGNANT 
HYPERTHERMIA SUSCEPTIBILITY-5; 


CCV 


GDB: 1336655 


CATARACT, CONGENITAL, 

VOT T^MATMTsI TYPP- PPV 
v kj jl» j\j v i>j in 11 Jr J_/ 5 v 


CD3Z 


GDB:1 19766 


CD3Z ANTIGEN, ZETA POLYPEPTIDE; 
CD3Z 


CDC2L1 


GDB: 127827 


PROTEIN KINASE p58; PK58 


CHML 


GDB: 135222 


CHOROIDEREMIA-LIKE; CHML 


CHS1 


GDB:4568202 


CHEDIAK-HIGASHI SYNDROME; 
CHS1 


CIAS1 


GDB.-9957338 


COLD HYPERSENSITIVITY 
URTICARIA, DEAFNESS, AND 
AMYLOIDOSIS 




CLCNKB 


GDB:698472 


CHLORIDE CHANNEL, KIDNEY, B; 
CLCNKB 


CMD1A 


GDB:434478 


CARDIOMYOPATHY, DILATED 1 A; 
CMD1A 


CMH2 


GDB: 137324 


CARDIOMYOPATHY, FAMILIAL 
HYPERTROPHIC, 2; CMH2 


CMM 


GDB:1 19059 


MELANOMA, MALIGNANT 


COL11A1 


GDB: 120595 


COLLAGEN, TYPE XI, ALPHA- 1 ; 
COL1IA1 
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Gene 


GDB Accession ID 


OMIM Link 


COL9A2 


GDB:138310 


COLLAGEN, TYPE DC, ALPHA-2 
CHAIN; COL9A2 EPIPHYSEAL 

rwCHT ACTA A Al TT ' 1 ' 1 L)T TJ O. CTWiTO 

DYbrLAolA, MUL 1 IrLc, L\ riL)M2 


CPT2 


uDB: 127272 


iv /rvrxo a txiv vx/tttj rMunYTrnvT/-' v rvc 
MYOPAlrii Wllti L"t.rlULbJNU I Ur 

CARNITINE 

PALMITOYLTRANSFERASE II 
HYPOGLYCEMIA, HYPOKETOTIC, 
WITH DEFICIENCY OF CARNITINE 
PALMITOYLTRANSFERASE 
CARNITINE 

PALMITOYLTRANSFERASE II; CPT2 


CRB1 


GDB:333930 


RETINITIS PIGMENTOSA- 12; RP12 




CSE 


GDB:596182 


CHOREOATHETOSIS/SPASTICITY, 
EPISODIC; CSE 


CSF3R 


GDB: 126430 


COLONY STIMULATING FACTOR 3 
RECEPTOR, GRANULOCYTE; CSF3R 


CTPA 


GDB:9863168 


CATARACT, POSTERIOR POLAR 


CTSK 


GDB:453910 


PYCNODYSOSTOSIS CATHEPSIN K; 
CTSK 


DBT 


GDB: 11 8784 


MAPLE SYRUP URINE DISEASE, TYPE 
2 


DIOl 


GDB:136449 


THYROXINE DEIODINASE TYPE I; 
TXDI1 


DISCI 


GDB:9992707 


DISORDER-2; SCZD2 




DPYD 


GDB:364102 


DIHYDROPYRIMIDINE 
DEHYDROGENASE; DPYD 


EKV 


GDB:1 19106 


ERYTHROKERATODERMIA 
VARIABILIS; EKV 


ENOl 


GDB: 119871 


PHOSPHOPYRUVATE HYDRATASE; 
PPH 


ENOIP 


GDB:.l 35006 


PHOSPHOPYRUVATE HYDRATASE; 
PPH 



35 
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Gene 


GDB Accession ID 


OMIM Link 


5 


EPB41 


GDB.l 19865 


ERYTHROCYTE MEMBRANE 
PROTEIN BAND 4.1; EPB4 1 
HEREDITARY HEMOT YTTP 




EPHX1 


GDB: 119876 


EPOXIDE HYDROLASE 1, i 
MICROSOMAL- EPHX1 




F13B 


GDB: 119893 


FACTOR XIII, B SUBUNIT; F13B 




F5 


GDB:1 19896 


FACTOR V DEFICIENCY 


10 


FCGR2A 


GDB: 119903 


Fc FRAGMENT OF IgG, LOW AFFINITY 
Ha, RECEPTOR FOR; FCGR2 A 




FCGR2B 


GDB:128183 


Fc FRAGMENT OF IgG, LOW AFFINITY 
Ha, RECEPTOR FOR; FCGR2A 


15 


FCGR3A 


GDB:1 19904 


Fc FRAGMENT OF IgG, LOW AFFINITY 
Ilia, RECEPTOR FOR; FCGR3A 




FCHL 


GDB:9837503 


HYPERLIPIDEMIA, COMBINED 






20 


FH 


GDB: 1191 33 


FUMARATE HYDRATASE; FH 
LEIOMYOMATA, HEREDITARY 
MULTIPLE, OF SKIN 


FM03 


GDB: 135 136 


FLAVIN-CONTAINING 
MONOOXYGENASE 3; FM03 
TRIMETHYLAMINURIA 




FM04 


GDB: 127981 


FLAVIN-CONTAINING 
MONOOXYGENASE 2; FM02 


25 


FUCA1 


GDB: 119237 


FUCOSIDOSIS 




FY 


GDB: 119242 


BLOOD GROUP-DUFFY SYSTEM; Fy 




GALE 


GDB: 119245 


GALACTOSE EPIMERASE 
DEFICIENCY 


30 


GBA 


GDB: 119262 


GAUCHER DISEASE, TYPE I; GD I 






GFND 


GDB:9958222 


GLOMERULAR NEPHRITIS, 
FAMILIAL, WITH FIBRONECTIN 
DEPOSITS 



35 



-61- 



NY2- 1321355.1 



603? 





Gene 


GDB Accession ID 
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5 


GJA8 


GDB:696369 


CATARACT, ZONULAR 
PULVERULENT 1; CZP1 GAP 
JUNCTION PROTEIN, ALPHA-8, 50-KD; 
GJA8 




GJB3 


GDB: 127820 


ERYTHROKERATODERMIA 
VARIABILIS; EKV DEAFNESS, 
AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 
2; DFNA2 


10 






GLC3B 


GDB:3801939 


GLAUCOMA 3, PRIMARY INFANTILE, 
B- GLC3B 




HF1 


GDB:120041 


H FACTOR 1;HF1 


15 


HMGCL 


GDB:138445 


HYDROXYMETHYLGLUTARICACIDU 
RIA; HMGCL 




HPC1 


GDB:5215209 


PROSTATE CANCER; PRCA1 
PROSTATE CANCER, HEREDITARY 1 


20 


HRD 


GDB:9862254 


HYPOPARATHYROIDISM WITH 
SHORT STATURE, MENTAL 
RETARDATION, AND SEIZURES 


HRPT2 


GDB:125253 


HYPERPARATHYROIDISM, FAMILIAL 
PRIMARY, WITH MULTIPLE 
OSSIFYING JAW 




HSD3B2 


GDB: 134044 


ADRENAL HYPERPLASIA II 


25 


HSPG2 


GDB: 126372 


HEPARAN SULFATE 
PROTEOGLYCAN OF BASEMENT 
MEMBRANE; HSPG2 MYOTONIC 
MYOPATHY, DWARFISM, 
CHONDRODYSTROPHY, AND 
OCULAR AND FACIAL 


30 


KCNQ4 


GDB:439046 


DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 
2; DFNA2 








KCS 


GDB:9848740 


KENNY-CAFFEY SYNDROME, 
RECESSIVE FORM 


35 


KIF1B 


GDB: 128645 


CHARCOT-MARIE-TOOTH DISEASE, 
NEURONAL TYPE, A; CMT2A 
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LAMB3 


GDB:251820 


LAMININ, BETA 3; LAMB3 


5 


LAMC2 


GDB:136225 


LAMINTN, GAMMA 2; LAMC2 
EPIDERMOLYSIS BULLOSA LETALIS 




LGMD1B 


GDB:231606 


MUSCULAR DYSTROPHY, 
LIMB-GIRDLE, TYPE IB; LGMD1B 


10 


LMNA 


GDB.T32146 


LAMIN A/C; LMNA LIPODYSTROPHY, 
FAMILIAL PARTIAL, DUNNIG AN 
TYPE- LDP1 




LOR 


GDB: 132049 


LORICR1N; LOR 




MCKD1 


GDB:9859381 


POLYCYSTIC KIDNEYS, MEDULLARY 
TYPE 


15 


MCL1 


GDB: 139137 


MYELOID CELL LEUKEMIA 1; MCL1 




MPZ 


GDB: 125266 


HYPERTROPHIC NEUROPATHY OF 
DEJERINE-SOTTAS MYELIN PROTEIN 
ZERO; MPZ 


20 


MTHFR 


GDB:370882 


5 , 1 0-@METHYLENETETRAHYDROFO 
LATE REDUCTASE; MTHFR 


MTR 


GDB: 11 9440 


METHYLTETRAHYDROFOLATE-.L-HO 
MOCYSTEEME 

S-METHYLTRANSFERASE; MTR 




MUTYH 


GDB:9315115 


ADENOMATOUS POLYPOSIS OF THE 
COLON; APC 


25 






MYOC 


GDB:5584221 


GLAUCOMA 1, OPEN ANGLE; GLC1A 
MYOCIUN; MYOC 




NB 


GDB:9958705 


NEUROBLASTOMA; NB 


30 


NCF2 


GDB:120223 


GRANULOMATOUS DISEASE, 
CHRONIC, AUTOSOMAL 
CYTOCHROME-b-POSITIVE FORM 




NEM1 


GDB: 127387 


NEMALINE MYOPATHY 1, 
AUTOSOMAL DOMINANT; NEM1 


35 


NPHS2 


GDB:9955617 


ARRHYTHMOGENIC RIGHT 
VENTRICULAR DYSPLASIA, 
FAMILIAL, 2; ARVD2 
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NPPA 


GDB:118727 


NATRIURETIC PEPTIDE PRECURSOR 
A; NPPA 


NRAS 


GDB: 11 9457 


ONCOGENE NRAS; NRAS; NRAS1 


NTRK1 


GDB:127897 


ONCOGENE TRK NEUROTROPHIC 
TYROSINE KINASE, RECEPTOR, TYPE 
1; NTRK1 NEUROPATHY, 
CONGENITAL SENSORY, WITH 
ANHIDROSIS 


OPTA2 


GDB:9955577 


OSTEOPETROSIS, AUTOSOMAL 
DOMINANT, TYPE II; OPA2 


PBX1 


GDB: 125351 


PRE-B-CELL LEUKEMIA 
TRANSCRIPTION FACTOR-1; PBX1 


PCHC 


GDB:9955586 


PHEOCHROMOCYTOMA 


PGD 


GDB: 119486 


6-@PHOSPHOGLUCONATE 
DEHYDROGENASE, ERYTHROCYTE 


PHA2A 


GDB:9955628 


PSEUDOHYPOALDOSTERONISM, 
TYPE E; PHA2 


PHGDH 


GDB:9958261 


3-@PHOSPHOGLYCERATE 
DEHYDROGENASE DEFICIENCY 


PKLR 


GDB: 120294 


PYRUVATE KINASE DEFICIENCY OF 
ERYTHROCYTE 


PKP1 


GDB:4249598 


PLAKOPHIL1N 1; PKP1 


PLA2G2A 


GDB: 120296 


PHOSPHOLIPASE A2, GROUP HA; 
PLA2G2A 


PLOD 


GDB: 127821 


PROCOLLAGEN-LYSINE, 
2-OXOGLUTARATE 5-DIOXYGENASE; 
PLOD EHLERS-DANLOS SYNDROME, 
TYPE VI; E-D VI; EDS VI 


PPOX 


GDB: 118852 


PROTOPORPHYRINOGEN OXIDASE; 
PPOX 


PPT 


GDB: 125227 


CEROID-LIPOFUSCn^OSIS, 
NEURONAL 1 , E*F ANTILE; CLN 1 | 
PALMITOYL-PROTEIN 
THIOESTERASE; PPT 



10 



15 



20 



25 



30 



35 
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PRCC 


GDB.3888215 


PAPILLARY RENAL CELL 
CARCINOMA; PRCC 


5 


PRG4 


GDB:9955719 


ARTHROPATHY-CAMPTODACTYLY 
SYNDROME 








PSEN2 


GDB-.633044 


ALZHEIMER DISEASE, FAMILIAL, 
TYPE 4- AD4 


10 


PTOS1 


GDB:6279920 


PTOSIS HERFDITAR Y C OTMCrKNTT A T 
l;PTOSl 




REN 


GDB: 120345 


RENIN; REN 




RFX5 


GDB.-6288464 


REGULATORY FACTOR 5; RFX5 


15 


RHD 


GDB: 119551 


RHESUS BLOOD GROUP, D ANTIGEN; 
RHD 




RMD1 


GDB:448902 


RIPPLING MUSCLE DISEASE-1 ; RMD1 


20 


RPE65 


GDB:226519 


RETINAL PIGMENT 
EPITHELIUM-SPECIFIC PROTEIN, 
65-KD; RPE65 AMAUROSIS 
CONGENITA OF LEBER II 


SCCD 


GDB:9955558 


CORNEAL DYSTROPHY, 
CRYSTALLINE, OF SCHNYDER 




SERPINC1 


GDB: 119024 


ANTITHROMB IN III DEFICIENCY 


25 


SJS1 


GDB:1381631 


MYOTONIC MYOPATHY, DWARFISM, 
CHONDRODYSTROPHY, AND 
OCULAR AND FACIAL 




SLC19A2 ! 


GDB:9837779 


THIAMINE-RESPONSrVE 
MEGALOBLASTIC ANEMIA 
SYNDROME 


30 


SLC2A1 


GDB:120627 


SOLUTE CARRIER FAMILY 2, 
MEMBER 1;SLC2A1 


SPTA1 


GDB:1 19601 


ELLIPTOCYTOSIS, 
RHESUS-UNLINKED TYPE 
HEREDITARY HEMOLYTIC ! 
SPECTRIN, ALPHA, ERYTHROCYTIC 
1; SPTA1 


35 


TALI 


GDB: 120759 


T-CELL ACUTE LYMPHOCYTIC 
LEUKEMIA 1; TALI 
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TNFSF6 


GDB.-422178 


APOPTOSIS ANTIGEN LIGAND 1; 
APT1LG1 


TNNT2 


GDB:221879 


TROPONIN-T2, CARDIAC; TNNT2 


TPM3 


GDB:127872 


ONCOGENE TRK TROPOMYOSIN 3; 
TPM3 


TSHB 


GDB* 120467 


BETA CHAIN; TSHB 


UMPK 


GDB: 120481 


URIDINE MONOPHOSPHATE KINASE; 
UMPK 


UOX 


GDB: 127539 


URATE OXIDASE; UOX 


UROD 


GDB: 119628 


PORPHYRIA CUTANEA TARDA* PCT 


USH2A 


GDB: 120483 


USHER SYNDROME, TYPE II; USH2 


VMGLOM 


GDB:9958134 


GLOMUS TUMORS, MULTIPLE 


VWS 


GDB: 120532 


CLEFT LIP AND/OR PALATE WITH 
MUCOUS CYSTS OF LOWER LIP 


WS2B 


GDB:407579 


WAARDENBURG SYNDROME, TYPE 
2B; WS2B 



20 

Table 3: Genes, Locations and Genetic Disorders on Chromosome 2 



Gene 


GDB Accession ED 


Location 


OMIM Link 


ABCB11 


GDB:9864786 


2q24-2q24 
2q24.3-2q24.3 


CHOLESTASIS, PROGRESSIVE 
FAMILIAL INTRAHEPATIC 2; 
PFIC2 


ABCG5 


GDB: 10450298 


2p21-2p21 


PHYTOSTEROLEMIA , 


ABCG8 


GDB: 10450300 


2p21-2p21 


PHYTOSTEROLEMIA 


ACADL 


GDB: 118745 


2q34-2q35 


ACYL-CoA 
DEHYDROGENASE, 
LONG-CHAIN, DEFICIENCY OF 


ACPI 


GDB: 118962 


2p25-2p25 


PHOSPHATASE, ACID, OF 
ERYTHROCYTE; ACPI 


AGXT 


GDB:127113 


2q37.3-2q37.3 


OXALOSIS I 


AHHR 


GDB:1 18984 


2pter-2q31 j 


CYTOCHROME P450, 
SUBFAMILY L POLYPEPTIDE 
1; CYP1A1 
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5 


ALMS1 


GDB:9865539 


2pl3-2pl2 
2pl4-2pl3 

XT i 

2pl3.1-2pl3.1 


ALSTROM SYNDROME 




ALPP 


GDB: 119672 


2q37.1-2q37.1 


ALKALINE PHOSPHATASE, 
PLACENTAL; ALPP 




ALS2 


GDB.l 35696 


2q33-2q35 


AMYOTROPHIC LATERAL 
SCLEROSIS 2, JUVENILE- ALS2 


10 


APOB 


GDB:1 19686 


2p24-2p23 
2p24-2p24 


APOLIPOPROTEIN B; APOB 




BDE 


GDB:9955730 


2q37-2q37 


BRACHYDACTYLY, TYPE E; 
BDE 


15 


BDMR 


GDB:533064 


2q37-2q37 


BRACHYDACTYLY-MENTAL 
RETARDATION SYNDROME; 
BDMR 


BJS 


GDB-.9955717 


2q34-2q36 


TORTI AND NERVE 
DEAFNESS 


20 


BMPR2 


GDB:642243 


2q33-2q33 
2q33-2q34 


PULMONARY 

HYPERTENSION, PRIMARY; 
PPH1 BONE MORPHOGENETIC 
RECEPTOR TYPE II; BMPR2 


CHRNA1 


GDB: 120586 


2q24-2q32 


CHOLINERGIC RECEPTOR 
NICOTINIC, ALPHA 
POLYPEPTIDE 1; CHRNA1 


Li J 


CMCWTD 


GDB:1 1498919 


2p22.3-2p21 


FAMILIAL CHRONIC 
MUCOCUTANEOUS, 
DOMINANT TYPE 




CNGA3 


GDB:434398 


2qll.2-2qll.2 


COLORBLINDNESS, TOTAL 
CYCLIC NUCLEOTIDE GATED 
CHANNEL, OLFACTORY, 3; 
CNG3 


30 


COL3AI 


GDB: 118729 


2q31-2q32.3 
2q32.2-2q32.2 


COLLAGEN, TYPE HI; COL3A1 
EHLERS-DANLOS 
SYNDROME, TYPE IV, 
AUTOSOMAL DOMINANT 




COL4A3 


GDB.128351 


2q36-2q37 


COLLAGEN, TYPE IV, 
ALPHA-3 CHAIN; COL4A3 


35 


COL4A4 


GDB: 132673 


2q35-2q37 


COLLAGEN, TYPE IV, 
ALPHA-4 CHAIN; COL4A4 
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5 


COL6A3 


GDB: 119066 


2q37.3-2q37.3 


COLLAGEN, TYPE VI, 
ALPHA-3 CHAIN; COL6A3 
MYOPATHY, BENIGN 
CONGENITAL, WITH 
PONTR A PTT TP PC 




CPS1 


GDB: 11 9799 


2q33-2q36 
2q34-2q35 
2q35-2q35 


HYPERAMMONEMIA DUE TO 
CARBAMOYLPHOSPHATE 
SYNTHETASE I DEFICIENCY 


10 


CRYGA 


GDB:1 19076 


2q33-2q35 


CRYSTALLIN, GAMMA A; 
CRYGA 




CRYGEP1 


GDB: 119808 


2q33-2q35 


CRYSTALLIN, GAMMA A; 
CRYGA 


15 


CYP1B1 


GDB:353515 


2p21-2p21 
2p22-2p21 
2pter-2qter 


GLAUCOMA 3, PRIMARY 
INFANTILE, A; GLC3A 
CYTOCHROME P450, 
ST 1RFAMTT Y T 

(DIOXIN-INDUCIBLE), 
POLYPEPTIDE 1; CYP1B1 




CYP27A1 


GDB: 128 129 


2q33-2qter 


CEREBROTEND1NOUS 
XANTHOMATOSIS 


20 


DBI 


GDB: 119837 


2ql2-2q21 


DIAZEPAM BINDING 
INHIBITOR; DBI 




DES 


GDB:1 19841 


2q35-2q35 


DESMIN; DES 


25 


DYSF 


GDB:340831 


2p-2p 

2pl3-2pl3 

2pter-2Dl2 


MUSCULAR DYSTROPHY, 
LIMB-GIRDLE, TYPE 2B; 
LGMD2B MUSCULAR 
DYSTROPHY, LATE-ONSET 
DISTAL 




EDAR 


GDB:9837372 


2qll-2ql3 


DYSPLASIA, HYPOHIDROTIC 
ECTODERMAL DYSPLASIA, 
ANHIDROTIC 


30 


EFEMP1 


GDB: 12201 11 


2pl6-2pl6 


DOYNE HONEYCOMB 
DEGENERATION OF RETINA 
FIBRILLIN-LIKE; FBNL 




EIF2AK3 


GDB:9956743 | 


2pl2-2pl2 


EPIPHYSEAL DYSPLASIA, 
MULTIPLE, WITH 
EARLY-ONSET DIABETES 
MELLITUS 



35 
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ERCC3 


GDB: 119881 


2q21-2q21 


EXCISION-REPAIR, 
COMPLEMENTING 
DFFFCTTVE IN CHINESE 
HAMSTER, 3; ERCC3 




FSHR 


GDB: 127510 


2p21-2pl6 


FOLLICLE-STIMULATING 
HORMOMF RFCEPTOR: FSHR 
GONADAL DYSGENESIS, XX 
TYPE 


10 


GAD1 


GDB: 119244 


2q31-2q31 


PYRIDOXINE DEPENDENCY 
WITH SEIZURES 




GINGF 


GDB:9848875 


2p21-2p21 


GINGIVAL SON OF 
SEVENLboo (DKUoUrHJLL/VJ 
HOMOLOGl;SOSl 


15 


GLC1B 


GDB:1297553 


2ql-2ql3 


GLAUCOMA 1, OPEN ANGLE, 
B; GLC1B 




GPD2 


GDB:354558 


zqZ4.l-zqz*f.i 


<TT VPT7ROT -1 -PHOSPHATE 
DEHYDROGENASE-2; GPD2 




GYPC 


GDB: 120027 


2ql4-2q21 


BLOOD GROUP— GERBICH; Ge 


20 


HADHA 


GDB:434026 


2p23-2p23 


HYDROXYACYL-CoA 

DEHYDROGENASE/3-KETOAC 

YL-CoA 

THIOLASE/ENOYL-CoA 


25 


HADHB 


GDB:344953 


ZpZJ-ZpZJ 


DEHYDROGENASE/3-KETOAC 
YL-CoA 

THIOLASE/ENOYL-CoA 
HYDRATASE, 




HOXD13 


GDB: 127225 


2q31-2q31 


HOMEO BOX-D13; HOXD13 
SYNDACTYLY, TYPE II 




HPE2 


GDB:136066 


2p21-2p21 


MIDLINE CLEFT SYNDROME 


30 


1GK.C 




9n 1"?-9r»19 
zp i z-z.p 1 z 

2pll.2-2pll.2 


IMMUNOGLOBULIN KAPPA 
CONSTANT REGION; IGKC 




EHH 


GDB:5 11203 


2q33-2q35 
2q35-2q35 
2pter-2qter 


BRACHYDACTYLY, TYPE Al; 
BDA1 INDIAN HEDGEHOG, 
DROSOPHILA, HOMOLOG OF; 
IHH 


35 


IRS1 


GDB:133974 


2q36-2q36 


INSULIN RECEPTOR 
SUBSTRATE 1; IRS1 
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ITGA6 Jc 


5DB.128027 h 


!pter-2qter U 


NTEGRIN, ALPHA-6; ITGA6 


KHK l< 


3DB:391903 Is 


> P 23.3-2p23.2 |l 


TtUCTOSURIA 


Ikynu |( 


3DB:9957925 \'< 


>q22.2-2q23.3 1 




LCT < 


3DB:120140 I: 


*q21-2q21 jl 


MSACCHARIDE 
INTOLERANCE H 


Ilhcgr I' 


GDB:125260 1 


2p21-2p21 1 


LUTEINIZING 

HORMONE/CHORIOGONADOT 
RHPlTsI RFCEPTOR: LHCGR 


Ilsfc I 


GDB.9956219 1 


2-2 2pl6-2pl6 j 


CYTOCHROME c OXIDASE 
DEFICIENCY, 

FHFlsTCH-CANADIAN TYPE 


1MSH2 I 


GDB.203983 1 


2pl6-2pl6 
2p22-2p21 | 


COLON CANCER, FAMILIAL, 
"WfYNJPOT YPOSIS TYPE 1; FCC1 


|mSH6 I 


GDB:632803 1 


2pl6-2pl6 


G/T MISMATCH-BINDING 

jr jS\j l nir>i ,vji di 


NEB 


GDB: 120224 


2q24.1-2q24.2 


NEBUL1N; NEB NEMALINE 
MYOPATHY 2, AUTOSOMAL 
RECESSIVE; NEM2 


Inmtc 


| GDB: 11498336 


|2q21-2q21 


1 THYROID CARCINOMA, 
PAPILLARY 


INPHPI 


GDB: 128050 


1 OnH 9n1 ^ 


1 NEPHRONOPHTHISIS, 

1 FAMILIAL JUVENILE 1 ; NPHP1 




JpAFAHlPl 


1 **** "*—^ T~% A *^ C f\f\t\ 

j GDB .435099 


1 9«1 1 9_?n1 1 2 


1 PLATELET-ACTF/ ATING 
FACTOR 

ACET YLHYDROL ASE, 
j GAMMA SUBUNIT 




|PAX3 


|GDB:120495 


|2q36-2q36 
2q35-2q35 


1 KLEIN- WAARDENBURG 
SYNDROME WAARDENBURG 
SYNDROME; WS1 




IPAX8 
0 | 


I GDB: 136447 


[2ql2-2ql4 


PAIRED BOX HOMEOTIC 
GENE8;PAX8 




Ipmsi 


IgDB:386403 


T2q31-2q33 


POSTMEIOTIC SEGREGATION 
INCREASED (S. 
|CEREVISIAE)-1; PMS1 




JPNKD 
5 j 


IgDB:5583973 


|2q33-2q35 


CHOREOATHETOSIS, 
FAMILIAL PAROXYSMAL; 
FPD1 
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PPH1 


GDB:1381541 


2q31-2q32 
2q33-2q33 


PULMONARY 

HYPERTENSION, PRIMARY; 
PPH1 




PROC 


GDB.120317 


2ql3-2q21 
2ql3-2ql4 


PROTEIN C DEFICIENCY, 
CONGENITAL THROMBOTIC 
DISEASE DUE TO 


10 


REG1A 


GDB: 132455 


2pl2-2pl2 


REGENERATING 
ISLET-DERIVED 1 -ALPHA; 
REG1A 


SAG 


GDB: 120365 


2q37.1-2q37.1 


S-ANTIGEN; SAG 




SFTPB 


GDB: 120374 


2pl2-2pll.2 


SURFACTANT-ASSOCIATED 
PROTEIN, PULMONARY-3 ; 
SFTP3 


15 


SLC11A1 


GDB:371444 


2q35-2q35 


CIRRHOSIS, PRIMARY; PBC 
NATURAL 

RESISTANCE-ASSOCIATED 
MACROPHAGE PROTEIN 1; 
NRAMP1 


20 


SLC3A1 


GDB.-202968 


2pl6.3-2pl6.3 
2p21-2p21 


SOLUTE CARRIER FAMILY 3, 
MEMBER 1;SLC3A1 
CYSTINURIA; CSNU 




SOS1 


GDB:230004 


2p22-2p21 


GINGIVAL SON OF 
SEVENLESS pROSOPHILA) 
HOMOLOG 1; SOS1 


25 


SPG4 


GDB:230127 


2p24-2p21 


SPASTIC PARAPLEGIA-4, 
AUTOSOMAL DOMINANT; 
SPG4 




SRD5A2 


GDB:127343 


2p23-2p23 


P S EUD O VAGINAL 
PERINEOSCROTAL 
HYPOSPADIAS; PPSH 


30 


TCL4 


GDB:136378 


2q34-2q34 


T-CELL 

LEUKEMIA/LYMPHOMA-4; 
TCL4 




TGFA 


GDB: 120435 


2pl3-2pl3 


TRANSFORMING GROWTH 
FACTOR, ALPHA; TGFA 




TMD 


GDB:9837196 


2q31-2q31 


TIBIAL MUSCULAR 
DYSTROPHY, TARDIVE 



35 
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TPO 


GDB: 120446 


2p25-2p25 
2p25-2p24 


THYROID 

HORMONOGENESIS, GENETIC 
DEFECT IN, IIA 


UGT1 


GDB: 120007 


2q37-2q37 


UDP 

GLUCURONOSYLTRANSFERA 
SE 1 FAMILY, Al; UGT1A1 


UV24 


GDB:9955737 


2pter-2qter 


UV-DAMAGE, EXCISION 
REPAIR OF, UV-24 


WSS 


GDB:9955707 


2q32-2q32 


WRINKLY SKIN SYNDROME; 
WSS 


XDH 


GDB:266386 


2p23-2p22 


XANTHINURIA 


ZAP70 


GDB:433738 


2qll-2ql3 
2ql2-2ql2 


SYK-RELATED TYROSINE 
KINASE; SRK 


ZFHX1B 


GDB:9958310 


2q22-2q22 


DISEASE, MICROCEPHALY, 
AND IRIS COLOBOMA 



10 



15 



Table 4: 



Genes, Locations and Genetic Disorders on Chromosome 3 



Gene 


GDB Accession ID 


Location 


OMIM Link 


ACAA1 


GDB: 119643 


3p23-3p22 


PEROXISOMAL 
3-OXOACYL-COENZYME A 
THIOLASE DEFICIENCY 


AGTR1 


GDB: 132359 


3q21-3q25 


ANGIOTENSIN II RECEPTOR, 
VASCULAR TYPE 1; AT2R1 


AHSG 


GDB: 118985 


3q27-3q27 


ALPHA-2-HS-GLYCOPROTEIN; 
AHSG 


AMT 


GDB: 132138 


3p21.3-3p21.2 
3p21.2-3p21.1 


HYPERGLYCINEMIA, ISOLATED 
NONKETOTIC, TYPE II; NKH2 


ARP 


GDB:9959049 


3p21.1-3p21.1 


ARGININE-RICH PROTEIN 


BBS3 


GDB:376501 


3p-3p 

3pl2.3-3qll.l 


BARDET-BIEDL SYNDROME, 
TYPE3;BBS3 


BCHE 


GDB: 120558 


3q26.1-3q26.2 


BUTYRYLCHOLINESTERASE; 
BCHE 


BCPM 


GDB:433809 j 


3q21-3q21 


BENIGN CHRONIC PEMPHIGUS; 
BCPM 


BTD 


GDB:309078 


3p25-3p25 


BIOTINIDASE; BTD 



20 



25 



30 
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10 



20 



30 



Gene 


GDB Accession ID 


Location 


OMIM Link 


CASR 


GDB:134196 


3q21-3q24 


HYPOCALCIURIC 
HYPERCALCEMIA, FAMILIAL; 
HHC1 


CCR2 


GDB:337364 


3p21-3p21 


CHEMOKINE (C-Cl RECEPTOR 2- 
CMKBR2 


CCR5 


GDB:1230510 


3d21-3d21 


CHEMOKINE fC-Q RECEPTOR 5- 
CMKBR5 


CDL1 


GDB: 13 6344 


3q26 3-3q26>3 


DE LANGE SYNDROME- CDL 


CMT2B 


GDB:604021 


3ql3-3q22 


CHARCOT-MARDE-TOOTH 
DISEASE, NEURONAL TYPE, B; 
CMT2B 


COL7A1 


GDB: 128750 


3p21-3p21 
3p21.3-3p21.3 


COLLAGEN, TYPE VII, ALPHA- 1 ; 
COL7A1 


CP 


GDB: 11 9069 


3q23-3q25 
3q21-3q24 


CERULOPLASMIN; CP 


CRV 


GDB: 11498333 


3p21.3-3p21.1 


VASCULOPATHY, RETINAL, 
WITH CEREBRAL 
LEUKODYSTROPHY 


CTNNB1 


GDB.141922 


3p22-3p22 
3p21.3-3p21.3 


CATENDSf BETA 1- CTNNB1 


DEM 


GDB:681157 


3ol2-3qll 


DEIVffiNTIA FAMILIAL 
NONSPECIFIC; DEM 


ETM1 


GDB:9732523 


3ql3-3ql3 


TREMOR, HEREDITARY 
ESSENTIA! 1 • FTM1 


FANCD2 


GDB:698345 


3p25.3-3p25.3 
3oter-3t>24 2 


FANCONI PANCYTOPENIA, 
POMPT FMFTsJT A TTOlsJ OR HTTP Fl 


FIH | 


GDB:9955790 


3ql3-3ql3 


HYPOPARATHYROIDISM, 
FAMILIAL ISOLATED- FIH 


FOXL2 


GDB: 129025 


3q23-3q23 
3q22-3q23 


BLEPHAROPHIMOSIS, 
EPICANTHUS INVERSUS, AND 
PTOSIS; BPES 


GBE1 


GDB:1 38442 


3pl2-3pl2 


GLYCOGEN STORAGE DISEASE 
IV 


GLB1 


GDB: 119987 


3p22-3p21.33 
3p21.33-3p21.33 


GANGLIOSIDOSIS, 
GENERALIZED GM1, TYPE I 
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Gene 


GDB Accession ID 


Location 


OMIM Link 




GLC1C 


GDB:3801941 


3q21-3q24 


GLAUCOMA 1, OPEN ANGLE, C; 
GLC1C 


5 


GNAI2 


GDB: 1205 16 


3p21.3-3p21.2 


GUANINE 

NUCLEOTIDE-BINDING 
PROTEIN, ALPHA-INHIBITING, 
POLYPEPTIDE-2; 


10 


GNAT1 


GDB: 119277 


3p21.3-3p21.2 


GUANINE 

NUCLEOTIDE-BINDING 
PROTEIN, 

ALPHA-TRANSDUCING, 
POLYPEPTIDE 




GP9 


GDB: 126370 


3pter-3qter 


PLATELET GLYCOPROTEIN EX; 
GP9 


15 


GPX1 


GDB: 119282 


3qll-3ql2 
3p21.3-3p21.3 


GLUTATHIONE PEROXIDASE- 
GPX1 


HGD 


GDB:203935 


3q21-3q23 


ALKAPTONURIA- AKU 




HRG 


GDB:120055 


3q27-3q27 


HISTIDINE-RICH 
GLYCOPROTEIN; HRG; HRGP 


20 


ITIH1 


GDB:120107 


3p21.2-3p21.1 


INTER-ALPHA-TRYPSIN 
INHIBITOR, HEAVY CHAIN-1; 
ITIH1 • IATIH- ITIH 

X ■* AX X J. y JU X X AA A- j & X XX X 




KNG 


GDB:125256 


3q27-3q27 


FLAUJEAC FACTOR 
DEFICIENCY 


25 


LPP 


GDB:1391795 


3q27-3q28 


LIM DOMAIN-CONTAINING 
PREFERRED TRANST OPATTON 
PARTNER IN LIPOMA; LPP 




LRS1 


GDB: 682448 j 


3d21 l-3t)14 1 


LARSEN SYNDROMF 
AUTOSOMAL DOMINANT; LRS1 




MCCC1 


GDB: 135989 


3q27-3q27 
3q25-3q27 


BETA-METHYLCROTONYLGLYC 
INURLAI 


30 


MDS1 


GDB:25041 1 


3q26-3q26 


MYELODYSPLASIA SYNDROME 
1;MDS1 




MHS4 


GDB:574245 


3ql3.1-3ql3.1 


HYPERTHERMIA 
SUSCEPTIBILITY-4; MHS4 


35 


MITF 


GDB:214776 


3pl4.1-3pl2 


MICROPHTHALMIA-ASSOCIATE 
D TRANSCRIPTION FACTOR; 
MITF WAARDENBURG 
SYNDROME, TYPE II; WS2 
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Gene 


GDB Accession ID 


Location 


OIVDM Link 




MLH1 


GDB:2496l7 


3p23-3p22 
3p21.3-3p21.3 


COLON CANCER, FAMILIAL, 
NONPOLYPOSIS TYPE 2; FCC2 


5 


MYL3 


GDB: 1 202 18 


3p21.3-3p21.2 


MYOSIN, LIGHT CHAIN, ALKALI, 
VENTRICULAR AND SKELETAL 
SLOW; MYL3 j 




MYMY 


GDB: 11500610 


3p26-3p24.2 


DISEASE 




OPA1 


GDB: 118848 


3q28-3q29 


OPTIC ATROPHY 1; OPA1 


10 


PBXP1 


GDB: 125352 


3q22-3q23 


PRE-B-CELL LEUKEMIA 
TRANSCRIPTION FACTOR-1 ; 
PBX1 \ 




PCCB ! 


GDB: 119474 


3q21-3q22 


KjL, Y UiiNxiJVliA, isJi 1U1 1U, 11 


15 


POU1F1 


GDB:129070 


3pll-3pll 


POU DOMAIN, CLASS 1, 
lKAJNoCKlr 11UJN JrALlUK 1, 
POU1F1 




PPARG 


GDB:1223810 


3p25-3p25 


CANCER OF COLON 
PEROXISOME PROLBFERATOR 
AC 1 IV A 1 cAJ KnXJbr 1 UK, 
GAMMA; PPARG [ 


20 


PROSl 


GDB: 120721 


3pll-3qll 
3pll.l-3qll.2 


DE>fYTT3rM O AT DT-J A • DD f\Q1 




PTHRl 


GDB:138128 


3p22-3p21.1 


METAPHYSEAL 
CHONDRODYSPLASIA, MURK 
JANSEN TYPE PARATHYROID 
HORMONE RECEPTOR 1 ; PTHRl 


25 


RCAl 


GDB:230233 


3pl4.2-3pl4.2 


RENAL CARCINOMA, FAMILIAL, 
ASSOCIATED 1 ; RCAl 




RHO 


GDB: 120347 


3q21.3-3q24 


RHODOPSIN; RHO 




SCA7 


GDB:454471 


3p21.1-3pl2 


SPINOCEREBELLAR ATAXIA 7; 
SCA7 


30 


SCLCl 


GDB:9955750 


3p23-3p21 


SMALL-CELL CANCER OF THE 
LUNG; SCCL 




SCN5A 


GDB:132152 


3p21-3p21 


SODRJM CHANNEL, 
VOLTAGE-GATED, TYPE V, 
ALPHA POLYPEPTIDE; SCN5A 




SI 


GDB: 120377 


3q25.2-3q26.2 


DISACCHARIDE INTOLERANCE I 



35 
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Gene 


GDB Accession ID 


Location 


OMIM Link 




SLC25A20 


GDB:6503297 


3p21.31-3p21.31 


CARNITINE-ACYLCARNITINE 
TRANSLOCASE; CACT 


5 


SLC2A2 


GDB: 119995 


3q26.2-3q27 
3q26.1-3q26.3 


SOLUTE CARRIER FAMILY 2, 
MEMBER 2; SLC2A2 
FANCONI-BICKEL SYNDROME; 
FBS 




TF 


GDB: 120432 


3q21-3q21 


TRANSFERRIN; TF 


10 


TGFBR2 


GDB:224909 


3p22-3p22 
3pter-3p24.2 


TRANSFORMING GROWTH 
FACTOR-BETA RECEPTOR, 
TYPE H; TGFBR2 




THPO 


GDB:374007 


3q26.3-3q27 


THROMBOPOIETIN; THPO 




THRB 


GDB: 120731 


3p24.1-3p22 
3p24.3-3p24.3 


THYROID HORMONE 
RECEPTOR, BETA; THRB 


15 


TKT 


GDB: 132402 


3pl4.3-3pl4.3 


WERNICKE-KORSAKOFF 
SYNDROME 




TM4SF1 


GDB:250815 


3q21-3q25 


TUMOR-ASSOCIATED ANTIGEN 
L6; TAAL6 


20 


TRH 


GDB: 128072 


3pter-3qter 


THYROTROPIN-RELEASING 
HORMONE DEFICIENCY 


UMPS 


GDB: 120482 


3ql3-3ql3 


OROTICACEDURIA I 




UQCRC1 


GDB: 141 850 


3p21.3-3p21.2 
3p21.3-3p21.3 


UBIQUINOL-CYTOCHROME c 
REDUCTASE CORE PROTEIN I; 
UQCRC1 


25 


USH3A 


GDB:392645 


3q21-3q25 


USHER SYNDROME, TYPE III; 
USH3 




VHL 


GDB: 120488 


3p26-3p25 


VON HIPPEL-LINDAU 
SYNDROME; VHL 


30 


WS2A 


GDB: 128053 


3pl4.2-3pl3 


MICROPHTHALMIA-ASSOCIATE 
D TRANSCRIPTION FACTOR; 
MITF WAARDENBURG 
SYNDROME, TYPE n; WS2 




XPC 


GDB: 134769 


3p25.1-3p25.1 


XERODERMA PIGMENTOSUM, 
COMPLEMENTATION GROUP C; 
XPC 


35 


ZNF35 


GDB:120507 


3p21-3p21 


ZINC FINGER PROTEIN-35; 
ZNF35 
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Table 5: 



Genes, Locations and Genetic Disorders on Chromosome 4 



Gene 


GDB Accession ID 


Location 


OMIM Link 


ADH1B 


GDB:1 19651 


4q21-4q23 
4q22-4q22 


ALCOHOL 

DEHYDROGENASE-2; ADH2 


ADH1C 


GDB: 119652 


4q21-4q23 
4q22-4q22 


ALCOHOL 

DEHYDROGENASE-3 ; ADH3 


AFP 


GDB: 119660 


4qll-4ql3 


ALPHA-FETOPROTEIN- AFP 


AGA 


GDB: 118981 


4q23-4q35 
4q32-4q33 


ASPARTYLGLUCOSAMINURIA 

a AK^A 4 * A • ^ * » A— f Vp/ V/ k^X AAT AAA ^ V-/ XX 1 X A 

; AGU 


AIH2 


GDB: 118751 


4qll-4ql3 
4ql3.3-4q21.2 


AMELOGENESIS IMPERFECTA 
2, HYPOPLASTIC LOCAL 
AUTOSOMAL DOMINANT; 


ALB 


GDB: 118990 


4qll-4ql3 


ALBUMIN; ALB 


ASMD 


GDB: 11 9705 


4q-4q 4a28-4a3 1 


ANTERIOR SEGMENT OCULAR 
DYSGENESIS; ASOD 


BFHD 


GDB:1 1498907 


4q34.1-4q35 


DYSPLASIA, BEUKES TYPE 


CNGA1 


GDB: 127557 


4pl4-4ql3 


CYCLIC NUCLEOTIDE GATED 
CHANNEL, PHOTORECEPTOR, 
cGMP GATED, 1; CNCG1 


CRBM 


GDB:9958132 


4pl6.3-4pl6.3 


CHERUBISM 


DCK 


GDB:126810 


4ql3.3-4q21.1 


DEOXYCYTIDINE KINASE; 
DCK 


DFNA6 


GDB:636175 


4pl6.3-4pl6.3 


DEAFNESS, AUTOSOMAL 
DOMINANT NONSYNDROMIC 
SENSORINEURAL, 6; DFNA6 


DSPP 


GDB:5560457 


4pter-4qter 
4q21.3-4q21.3 


DENTIN PHOSPHOPROTEIN; 
DPP DENTINOGENESIS 
IMPERFECTA; DGI1 


DTDP2 


GDB:9955810 


4q-4q 


DENTIN DYSPLASIA, TYPE II 


ELONG 


GDB:1 1498700 1 


4q24-4q24 j 




ENAM 


GDB:9955259 


4q21-4q21 \ 


AMELOGENESIS IMPERFECTA 
2, HYPOPLASTIC LOCAL, 
AUTOSOMAL DOMINANT; 
AMELOGENESIS IMPERFECTA, 
HYPOPLASTIC TYPE 


ETFDH 


GDB: 135992 


4q32-4q35 


GLUTARICACIDURIA IIC; GA 

nc 
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2 





Gene 


GDB Accession ID 


Location 


OMIM Link 




EVC 


GDB:555573 


4pl6-4pl6 


ELLIS-VAN CREVELD 
SYNDROME; EVC 


5 


Fll 


GDB:1 19891 


4q35-4q35 


PTA DEFICIENCY 




FABP2 


GDB:1 19127 


4q28-4q31 


FATTY ACID BINDING 
PROTEIN 2, INTESTINAL; 
FABP2 


10 


FGA 


GDB:1 19129 


4q28-4q28 i 


AMYLOIDOSIS, FAMILIAL 
VISCERAL FIBRINOGEN, A 
ALPHA POLYPEPTIDE; FGA 




FGB 


GDB: 119130 


4q28-4q28 


FIBRINOGEN, B BETA 
POLYPEPTIDE; FGB 


15 


FGFR3 


GDB:127526 


4pl6.3-4pl6.3 


ACHONDROPLASIA; ACH 
BLADDER CANCER 
FIBROBLAST GROWTH 
FACTOR RECEPTOR-3; FGFR3 




FGG 


GDB:1 19132 


4q28-4q28 


FIBRINOGEN, G GAMMA 
POLYPEPTIDE; FGG 


20 


FSHMD1A 


GDB:1 19914 


4q35-4q35 


FACIOSCAPULOHUMERAL 
MUSCULAR DYSTROPHY 1 A; 
FSHMD1A 


GC 


GDB:1 19263 


4ql2-4ql3 
4ql2-4ql2 


GROUP-SPECIFIC 
COMPONENT; GC 




GNPTA 


GDB: 119280 


4q21-4q23 


MUCOLIPIDOSIS II; ML2; ML H 


25 


GNRHR 


GDB:136456 


4ql3-4ql3 
4q21.2-4q21.2 


GONAD OTROPIN-RELEASING 
HORMONE RECEPTOR; GNRHR 


GYPA 


GDB: 118890 


4q28-4q31 
4q28.2-4q31.1 


BLOOD GROUP-MN LOCUS; 
MN 




HCA 


GDB:9954675 


4q33-4qter 


HYPERCALCIURIA, FAMILIAL 
IDIOPATHIC 


30 


HCL2 


GDB:1 19305 


4q28-4q31 4q-4q 


HAIR COLOR-2; HCL2 


HD 


GDB:1 19307 


4pl6.3-4pl6.3 


HUNTINGTON DISEASE; HD 




HTN3 


GDB: 125601 


4ql2-4q21 


HISTATEN-3; HTN3 




HVBS6 


GDB: 120687 


4q32-4q32 


HEPATOCELLULAR 
CARCINOMA-2; HCC2 


35 


IDUA 


GDB:1 19327 


4pl6.3-4pl6.3 


MUCOPOLYSACCHARIDOSIS 
TYPE I; MPS I 
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Gene 


GDB Accession ID 


Location 


OMIM Link 


DP 


GDB: 120077 


4q24-4q25 
4q25-4q25 


COMPLEMENT COMPONENT-3 
DEACTIVATOR, DEFICIENCY 
OF 


JPD 


GDB: 1201 13 


4pter-4qter 
4ql2-4ql3 


PERIODONTITIS, JUVENILE; 
JPD 


KIT 


GDB:120117 


4ql2-4ql2 


V-KIT HARDY-ZUCKERMAN 4 
FELINE SARCOMA VIRAL 
ONCOGENE HOMOLOG; KIT 


KLKB1 


GDB: 127575 


4q34-4q35 1 
4q35-4q35 


FLETCHER FACTOR 
DEFICIENCY 


LQT4 


GDB:682072 


4q25-4q27 


SYNDROME WITHOUT 

PSYCHOMOTOR 

RETARDATION 


MANBA 


GDB: 125261 


4q21-4q25 


MANNOSIDOSIS, BETA; 
MANB1 


MLLT2 


GDB: 136792 


4q21-4q21 


MYELOID/LYMPHOID OR 
MIXED LINEAGE LEUKEMIA, 
TRANSLOCATED TO, 2; MLLT2 


MSX1 


GDB:120683 


4pl6.3-4pl6.1 
4ol6 l-4nl6.1 


MSH, DROSOPHELA, HOMEO 
BOX, HOMOLOG OF, 1; MSX1 


MTP 


GDB:228961 


4q24-4q24 


MICROSOMAL TRIGLYCERIDE 
TRANSFER PROTEIN, 88 KD; 
MTP 


NR3C2 


GDB: 1201 88 


4q31-4q31 
4q31.1-4q31.1 


PSEUDOHYPOALDOSTERONIS 
M, TYPE I, AUTOSOMAL 
RECESSIVE; PHA1 


PBT 


GDB:120260 


4ql2-4q21 


PIEBALD TRAIT; PBT 


PDE6B 


GDB:125915 


4pl6.3-4pl6.3 


NIGHTBLINDNESS, 
CONGENITAL STATIONARY; 
CSNB3 PHOSPHODIESTERASE 
6B, cGMP-SPECIFIC, ROD, 
BETA; PDE6B 


PEE1 


GDB:7016765 


4q31-4q34 
4q25-4qter 


1; PEE1 



35 
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Gene 


GDB Accession ID 


Location 


OMIM Link 


5 


PITX2 


GDB: 134770 


4q25-4q27 
4q25-4q26 
4q25-4q25 


IRIDOGOMODYSGENESIS, 
TYPE 2; IRID2 RIEGER 
SYNDROME, TYPE 1; RIEG1 
RIEG BICOID-RELATED 
HOMEOBOX TRANSCRIPTION 
FACTOR 1; RIEG1 HOMEO BOX 
2 




PKD2 


GDB: 11 8851 


4q21-4q23 


POLYCYSTIC KIDNEY 
DISEASE 2; PKD2 


10 


QDPR 


GDB:120331 


4pl5.3-4pl5.3 
4pl5.31-4pl5.31 


PHENYLKETONURIA II 




SGCB 


GDB:702072 


4ql2-4ql2 


MUSCULAR DYSTROPHY, 
LIMB-GIRDLE, TYPE 2E; 
LGMD2E 




SLC25A4 


GDB: 119680 


4q35-4q35 


ADENINE NUCLEOTIDE 
TRANSLOCATOR 1; ANT1 
PROGRESSIVE EXTERNAL 
OPHTHALMOPLEGIA; PEO 


20 


SNCA 


GDB:439047 


4q21.3-4q22 
4q21-4q21 


SYNUCLEIN, ALPHA; SNCA 
PARKINSON DISEASE, 
FAMTT TAT TYPF t • PARTO 


SOD3 


GDB: 125291 


4pl6.3-4q21 


SUPEROXIDE DISMUTASE, 
FXTRACFT T TTT AR- *50m 




STATH 


GDB: 120391 


4qll-4ql3 


STATHERIN; STATH; STR 




TAPVR1 


GDB:392646 


4pl3-4qll 


ANOMALOUS PULMONARY 
VENOUS RETURN; APVR 


25 


TYS 


GDB: 119624 


4q-4q 


SCLEROTYLOSIS; TYS 




WBS2 


GDB:1 32426 


4q33-4q35.1 


WILLIAM S -BEUREN 
SYNDROME; WBS 


30 


WFS1 


GDB:434294 


4p-4p4pl6-4pl6 


DIABETES MELLITUS AND 
INSIPIDUS WITH OPTIC 
ATROPHY AND DEAFNESS 




WHCR 


GDB: 125355 


4pl6.3-4pl6.3 


WOLF-HIRSCHHORN 
SYNDROME; WHS 



35 
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Table 6: Genes, Locations and Genetic Disorders on Chromosome 5 



Gene 


GDB Accession ID 


OMHVf Link 


ADAMTS2 


GDB:9957209 


EHLERS-DANLOS SYNDROME, 
TYPE VH, AUTOSOMAL RECESSIVE 


ADRB2 


GDB: 120541 


BETA-2-ADRENERGIC RECEPTOR; 
ADRB2 


AMCN 


GDB:9836823 


ARTHROGRYPOSIS MULTIPLEX 
CONGENITA, NEUROGENIC TYPE 


AP3B1 


GDB:9955590 


HERMANSKY-PUDLAK 
SYNDROME; HPS 


APC 


GDB: 119682 ! 


ADENOMATOUS POLYPOSIS OF 
THE COLON; APC 


ARSB 


GDB: 119008 


MUCOPOLYSACCHARIDOSIS TYPE 
VI; MPS VI 


B4GALT7 


GDB:9957653 


SYNDROME, PROGEROID FORM 


BHR1 


GDB:9956078 


ASTHMA 


C6 


GDB: 119045 


COMPLEMENT COMPONENT-6, 
DEFICIENCY OF 


C7 


GDB:1 19046 


COMPLEMENT COMPONENT-7, 
DEFICIENCY OF 


CCAL2 


GDB:5584265 


CHONDROCALCINOSIS, FAMILIAL 
ARTICULAR 


CKN1 


GDB: 128586 


COCKAYNE SYNDROME, TYPE I; 
CKN1 


CMDJ 


GDB:9595425 


CRANIOMETAPHYSEAL 
DYSPLASIA, JACKSON TYPE; CMDJ 


CRHBP 


GDB: 127438 


CORTICOTROPIN RELEASING 
HORMONE-BINDING PROTEIN; 
CRHBP 


CSF1R 


GDB: 120600 


COLONY-STIMULATING FACTOR-1 
RECEPTOR; CSF1R 


DHFR 


GDB: 119845 


DIHYDROFOLATE REDUCTASE; 
DHFR 
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Gene 


GDB Accession ID 


OMIM Link 


DIAPH1 


GDB:9835482 


DEAFNESS, AUTOSOMAL 
DOMINANT NONSYNDROMIC 
SENSORINEURAL, 1; DFNA1 

DIAPHANOUS, DROSOPHILA 
HOMOT Ofr OF 1 


DTR 


GDB: 11 9853 


DIPHTHERIA TOXIN SENSITIVITY; 
DTS 


EOS 


GDB:9956083 


EOSINOPHILS, FAMILIAL 


ERVR 


GDB:9835857 


HYALOIDEORETINAL 
DEGENERATION OF WAGNER 


F12 


GDB:1 19892 


HAGEMAN FACTOR DEFICIENCY 


FBN2 


GDB:128122 


CONTRACTURAL 
ARACHNODACTYLY, 
CONGENITAL; CCA 


GDNF 


GDB:450609 


GLIAL CELL LINE-DERIVED 


GHR 


GDB: 119984 


GROWTH HORMONE RECEPTOR; 
GHR 


GLRAI 


GDB-1 18801 


f*T VPTNTP RFPFPTOR AT PRA-1 

SUBUNIT; GLRAI KOK DISEASE 


GM2A 


GDB: 120000 


TA Y-SACHS DISEASE, AB VARIANT 


HFXR 


GDR-IIQ^OR 




HSD17R4 


vJ-L/J_>._) O JUJ7 


DEHYDROGENASE IV; HSD17B4 




vXLAD. 1 ZoUj J 


TXTTTTriPTXT AT PUA O* TTYT-A1 
UN 1 JtHjJtvllN, /\JLrJrrl/\-Z, 1 1 KjPlZ 


KFS 


GDB:9958987 


VERTEBRAL FUSION 


LGMD1A 


GDB:1 18832 


MUSCULAR DYSTROPHY, 
LIMB-GIRDLE, TYPE 1 A; LGMD1A 


LOX 


GDB:119367 


LYSYL OXIDASE; LOX j 


LTC4S 


GDB.-384080 


LEUKOTRIENE C4 SYNTHASE; 
LTC4S 


MAN2A1 


GDB.136413 


MANNOSIDASE, ALPHA, II; MANA2 
DYSERYTHROPOIETIC ANEMIA, 
CONGENITAL, TYPE II 
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GDB Accession ID 


OMIM Link 


MCC 


GDB:128163 


MUTATED IN COLORECTAL 
CANCERS; MCC 


MCCC2 ! 


GDB: 135990 


n 


MSH3 


GDB:641986 


MutS, E. COLI, HOMOLOG OF, 3; 
MSH3 


MSX2 


GDB: 138766 


MSH (DROSOPfflLA) HOMEO BOX 
HOMOLOG 2; MSX2 PARIETAL 
FORAMINA, SYMMETRIC; PFM 


NR3C1 


GDB:120017 


GLUCOCORTICOID RECEPTOR; GRL 


PCSK1 


GDB: 128033 


PROPROTEIN CONVERTASE 
SUBTILISIN/KEXIN TYPE 1; PCSK1 


PDE6A 


GDB:120265 


PHOSPHODIESTERASE 6A, 
cGMP-SPECIFIC, ROD, ALPHA; 
PDE6A 


PFBI 


GDB:9956096 


INTENSITY OF INFECTION IN 


RASA1 


GDB: 120339 


RAS p21 PROTEIN ACTIVATOR 1; 
RASA1 


SCZD1 


GDB: 120370 


DISORDER-l;SCZDl 


SDHA 


GDB:378037 


SUCCINATE DEHYDROGENASE 
COMPLEX, SUBUNIT A, 
FLAVOPROTEIN; SDHA 


SGCD 


GDB:5886421 


SARCOGLYCAN, DELTA; SGCD 


SLC22A5 


GDB:9863277 


CARNITINE DEFICIENCY, 
SYSTEMIC, DUE TO DEFECT IN 
RENAL REABSORPTION 


SLC26A2 


GDB:125421 


DIASTROPHIC DYSPLASIA; DTD 
EPIPHYSEAL DYSPLASIA, 
MULTIPLE; MED NEONATAL 
OSSEOUS DYSPLASIA I 
ACHONDROGENESIS, TYPE IB; 
ACG1B 


SLC6A3 


GDB:132445 


SOLUTE CARRIER FAMILY 6, 
MEMBER 3; SLC6A3 
DEFICIT-HYPERACnVITY 
DISORDER; ADHD 


SMI 


GDB:9834488 


SCHISTOSOMA MANSONI 
SUSCEPTIBILITY/RESISTANCE 
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Gene 


GDB Accession ID 


OMIM Link 


SMA@ 


GDB: 120378 


SPINAL MUSCULAR ATROPHY I; 
SMA I SURVIVAL OF MOTOR 
NEURON 1, TELOMERIC; SMN1 


SMN1 


GDB:5215173 


SPINAL MUSCULAR ATROPHY I; 
SMA I SURVIVAL OF MOTOR 
NEURON 1, TELOMERIC; SMN1 


SMN2 


GDB:5215175 


SPINAL MUSCULAR ATROPHY I; 
SMA I SURVIVAL OF MOTOR 
NEURON 2, CENTROMERIC; SMN2 


SPENK5 


GDB:9956114 


NETHERTON DISEASE 


TCOF1 


GDB: 127390 


TREACHER 

COLLINS-FRANCESCHETTI 
oYNDKUMb 1, 1LUH 


TGFBI 


GDB.597601 


CORNEAL DYSTROPHY, 
GRANULAR TYPE CORNEAL 
DYSTROPHY, LATTICE TYPE I; 
CDL1 TRANSFORMING GROWTH 
FACTOR, BETA-INDUCED, 68 KD; 
TGFBI 
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Gene 


GDB Accession ID 


OMIM Link 


ALDH5A1 


GDB:454767 ! 


SUCCINIC SEMIALDEHYDE 
DEHYDROGENASE, 
NAD(+)-DEPENDENT; SSADH 


ARG1 


GDB: 119006 


ARGININEMIA 


AS 


GDB: 135697 


ANKYLOSING SPONDYLITIS; AS 


ASSP2 


GDB: 119017 


CITRULLINEMIA 


BCKDHB 


GDB: 118759 


MAPLE SYRUP URINE DISEASE, TYPE 
IB 


BF 


GDB: 119726 


GLYCINE-RICH 
BETA-GLYCOPROTEIN; GBG 


C2 


GDB: 119731 


COMPLEMENT COMPONENT-2, 
DEFICIENCY OF 


C4A 


GDB: 119732 


COMPLEMENT COMPONENT 4A; C4A 


CDKN1A 


GDB-.266550 


CYCLDM-DEPENDENT KINASE 
INHIBITOR 1A; CDKN1A 
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Gene 


GDB Accession ID 


OMIM Link 




COL10A1 


GDB: 128635 


COLLAGEN, TYPE X, ALPHA 1; 
COL10A1 




POT 1 1 AO 
LUL1 1AZ 


UL/D. 1 17/65 


POT T AOFN TYPF XI AT PHA-2- 
COL1 1 A2 STICKLER SYNDROME, 

nn TT OTT ^ TAT? A 17VTTJPO 

TYPE II; STL2 DEAFNESS, 
AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 
13; DFNA13 


10 


CYP21A2 


GDB: 120605 


ADRENAL HYPERPLASIA, 

CONGENITAL, DUE TO 

21 -HYDROXYLASE DEFICIENCY 




DYX2 


GDB:437584 


DYSLEXIA, SPECIFIC, 2; DYX2 




EJM1 


GDB: 119864 


MYOCLONIC EPILEPSY, JUVENILE; 
EJM1 


15 


ELOVL4 


GDB: 11499609 


STARGARDT DISEASE 3; STGD3 




EPM2A 


GDB-.3763331 


EPILEPSY, PROGRESSIVE 
MYOCLONIC 2; EPM2 




ESR1 


GDB:1 19120 


ESTROGEN RECEPTOR; ESR 


20 


EYA4 


GDB:700062 


DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 
10;DFNA10 




F13A1 


GDB: 120614 


FACTOR XIII, Al SUBUNIT; F13A1 




FANCE 


GDB: 1220236 


FANCONI ANEMIA, 
COMPLEMENTATION GROUP E; FACE 


25 


GCLC 


GDB: 1329 15 


GAMMA-GLUTAMYLCYSTEINE 
SYNTHETASE DEFICIENCY, 
HEMOLYTIC ANEMIA DUE 




GJA1 


GDB: 125 196 


GAP JUNCTION PROTEIN, ALPHA-1, 
43 KD; GJA1 


30 


GLYS1 


GDB: 136421 


GLYCOSURIA, RENAL 


GMPR 


GDB: 127058 


GUANINE MONOPHOSPHATE 
REDUCTASE 




GSE 


GDB:9956235 


DISEASE; CD 


35 


HCR 


GDB:9993306 


PSORIASIS, SUSCEPTIBILITY TO 
HFEGDB: 119309 
HEMOCHROMATOSIS; HFE 
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GDB Accession ID 


OMEM Link 


HLA-A 


GDB: 119310 


HLA-A HISTOCOMPATIBILITY TYPE; 
HT A A HLA-DPB1GDB120636 LA-DP 
HISTOCOMPATIBILITY TYPE, BETA-1 
SUBUNIT ! 


HLA-DRA 


GDB:120641 


HLA-DR HISTOCOMPATIBILITY 
TYPE; HLA-DRA 


HPFH 


GDB:9849006 


HETEROCELLULAR HEREDITARY 
PERSISTENCE OF FETAL 
HEMOGLOBIN 


ICS1 


GDB: 136433 


IMMOTILb CILIA bYNUKUMI>l, ICoI 


IDDM1 ; 


GDB:9953173 


DIABETES MELLITUS, 
JUVENILE-ONSET 
INSULIN-DEPENDENT; IDDM 


IFNGR1 


GDB: 120688 


INTERFERON, GAMMA, RECEPTOR-1; 
IFNGR1 


IGAD1 


GDB:6929077 


SELECTIVE DEFICIENCY OF 


IGF2R 


GDB: 120083 


INSULIN-LIKE GROWTH FACTOR 2 
RECEPTOR; IGF2R 


ISCW 


GDB:9956158 


SUPPRESSION; IS 


LAMA2 


GDB:132362 


LAMININ, ALPHA 2; LAMA2 


LAP 


GDB:9958992 


LARYNGEAL ADDUCTOR 
PARALYSIS; LAP 


LCA5 


GDB: 11498764 


AMAUROSIS CONGENITA OF LEBER I 


LPA 


GDB: 120699 


APOLIPOPROTEIN(a); LPA 


MCDR1 


GDB: 13 1406 


MACULAR DYSTROPHY, RETINAL, 1, 
NORTH CAROLINA TYPE; MCDR1 


MOCS1 


GDB:9862235 


MOLYBDENUM COFACTOR 
LMir iOUtiN C Y 


MUT 


GDB: 120204 


METHYLMALONICACIDURIA DUE TO 
METHYLMALONIC CoA MUTASE 
DEFICIENCY 


MYB 


GDB:1 19441 


V-MYB AVIAN MYELOBLASTOSIS 
VIRAL ONCOGENE HOMOLOG; MYB 


NEU1 


GDB: 120230 


NEURAMINIDASE DEFICIENCY 
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GDB Accession ID 


OMIM Link 




NKS1 


GDB:128100 


SUSCEPTIBILITY TO LYSIS BY 
ALLOREACTTVE NATURAL KILLER 
CELLS; EC1 


5 


NYS2 


GDB:9848763 


NYSTAGMUS, CONGENTTAL 




OA3 


GDB: 136429 


ALBINISM, OCULAR, AUTOSOMAL 
RECESSIVE; OAR 


10 


ODDD 


GDB:6392584 


OCULODENTODIGITAL DYSPLASIA; 1 
ODDD 


OFC1 


GDB: 120247 { 


OROFACIAL CLEFT 1; OFC1 




PARK2 


GDB:6802742 


PARKINSONISM, JUVENILE 




PBCA 


GDB:9956321 


BETA CELL AGENESIS WITH 
NEONATAL DIABETES MELLITUS 


15 


PBCRA1 


GDB:3763333 


CHORIORETINAL ATROPHY, 
PROGRESSIVE BIFOCAL; CRAPB 




PDB1 


GDB: 136349 


DISEASE OF BONE; PDB 




PEX3 


GDB:9955507 


ZELLWEGER SYNDROME; ZS 


20 


PEX6 


GDB:5592414 


ZELLWEGER SYNDROME; ZS 
PEROXIN-6; PEX6 j 




PEX7 


GDB:6155803 


RHIZOMELIC CHONDRODYSPLASIA 
PUNCTATA; RCDP PEROXIN-7; PEX7 




PKHD1 


GDB:433910 


POLYCYSTIC KIDNEY AND HEPATIC 
DISEASE-1;PKHD1 


25 


PLA2G7 


GDB:9958829 


PLATELET-ACTIVATING FACTOR 
ACETYLHYDROLASE, SUBUNIT 




PLG 


GDB: 119498 


PLASMINOGEN; PLG 




POLH 


GDB:6963323 


PIGMENTOSUM WITH NORMAL DNA 
REPAIR RATES 


30 


PPAC 


GDB:9956248 


ARTHROPATHY, PROGRESSIVE 
PSEUDORHEUMATOID, OF 
CHILDHOOD 




PSORS1 


GDB:6381310 


PSORIASIS, SUSCEPTIBILITY TO 




PUJO 


GDB:9956231 


MULTIC YSTIC RENAL DYSPLASIA, 
BILATERAL; MRD 


35 


RCD1 


GDB:333929 


RETINAL CONE DEGENERATION 
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Gene 


GDB Accession ID 


OMDVf Link 




RDS 


GDB. 118863 


T>t?TT\.TAT r»pnFMPP attom Qinw- 
RDS 


5 1 


RHAG 


GDB: 136011 


RHESUS BLOOD 
GKO U r- AboLM^IA 1 E.JLI 
GLYCOPROTEIN; RHAG RH-NULL, 
REGULATOR TYPE; RHN 




RP14 


GDB:433713 


RETINITIS PIGMENTOSA- 14; RP14 
TUBBY-LIKE PROTEIN 1; TULP1 


10 


RUNX2 


GDB:392082 


CLEIDOCRANIAL DYSPLASIA; CCD 
CORE-BINDING FACTOR, RUNT 
DOMAIN, ALPHA SUBUNIT 1; CBFA1 




RWS 


GDB:9956195 


SENSITIVITY 




SCA1 


GDB: 119588 ; 


SPINOCEREBELLAR ATAXIA 1; SCA1 


15 


SCZD3 


GDB:635974 


DISORDER-3; SCZD3 




SIASD 


GDB:433552 


SIALIC ACID STORAGE DISEASE; 
SIASD 




SOD2 


GDB: 119597 


SUPEROXIDE DISMUTASE 2, 
MITOCHONDRIAL; SOD2 


20 


ST8 


GDB:61 18456 


OVARIAN TUMOR 




TAP1 


GDB: 132668 


TRANSPORTER 1 , ABC; TAP 1 




TAP2 


GDB: 132669 


TRANSPORTER 2, ABC; TAP2 


25 


TFAP2B 


GDB:681506 


DUCTUS ARTERIOSUS; PDA 
TRANSCRIPTION FACTOR AP-2 BETA; 
TFAP2B 




TNDM 


GDB:9956265 


DIABETES-MELLITUS, TRANSIENT 
NEONATAL 




TNF 


GDB-.120441 


TUMOR NECROSIS FACTOR; TNF 


30 


TPBG 


GDB: 125568 


TROPHOBLAST GLYCOPROTEIN; 
TPBG; M6P1 




TPMT 


GDB:209025 


THIOPURINE 

S-METHYLTRANSFERASE; TPMT 




TULP1 


GDB:6199353 


TUBBY-LIKE PROTEIN 1; TULP1 


35 


WISP3 


GDB:9957361 


ARTHROPATHY, PROGRESSIVE 
PSEUDORHEUMATOID, OF 
CHILDHOOD 
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Gene 


GDB Accession ID 


OMDVf Link 


5 


AASS 


GDB: 11 502144 


HYPERL Y S INEMIA 




ABCB1 


GDB: 120712 


P-GLYCOPROTEIN-1; PGY1 




ABCB4 


GDB:120713 


P-GLYCOPROTEIN-3; PGY3 


10 


ACHE 


GDB: 118746 


ACETYLCHOLINESTERASE BLOOD 
GROUP-Yt SYSTEM; YT 


AQP1 


GDB: 129082 


AQUAPORJN-1; AQP1 BLOOD 
GROUP-COLTON; CO 




ASL 


GDB: 119703 


ARGININOSUCCINICACIDURIA 




ASNS 


GDB: 11 9706 


ASPARAG1NE SYNTHETASE; ASNS; AS 


15 


AUTS1 


GDB:9864226 


DISORDER 




BPGM 


GDB: 119039 


DIPHOSPHOGLYCERATE ML) I Abb 
DEFICIENCY OF ERYTHROCYTE 




C7orf2 


GDB: 10794644 


ACHEIROPODY 


20 


CACNA2D1 


GDB:132010 


CALCIUM CHANNEL, 
VOLTAGE-DEPENDENT, L TYPE, 
ALP H A-2/DELT A SUBUNIT; 
MALIGNANT HYPERTHERMIA 
SUSCEPTIBILITY-3 




CCM1 


GDB:580824 


CEREBRAL CAVERNOUS 
MALFORMATIONS 1; CCM1 


25 


CD36 


GDB: 138800 


CD36 ANTIGEN; CD36 




CFTR 


GDB: 120584 


CYSTIC FIBROSIS; CF DEFERENS, 
CONGENITAL BILATERAL APLASIA 
OF; CBAVD; CAVD 




CHORDOMA 


GDB: 11498328 




30 


CLCN1 


GDB: 134688 


CHLORIDE CHANNEL 1, SKELETAL 
MUSCLE; CLCN1 




CMH6 


GDB-.9956392 


CARDIOMYOPATHY, FAMILIAL 
HYPERTROPHIC, WITH 
WOLFF-PARKINSON- WHITE 


35 


CMT2D 


GDB:9953232 


CHARCOT-MARIE-TOOTH DISEASE, 
NEURONAL TYPE, D 
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GDB Accession ID 


OMIMLink 


5 


C0L1A2 


GDB:1 19062 


COLLAGEN, TYPE I, ALPHA-2 
POLYPEPTIDE; COL1A2 
OSTEOGENESIS IMPERFECTA TYPE I 
OSTEOGENESIS IMPERFECTA TYPb 
IV;OI4 




CRS 


GDB: 119073 


CRANIOSYNOSTOSIS, TYPE 1; CRS1 




CYMD 


GDB:366594 


MACULAR EDEMA CYSTOID 


10 


DFNA5 


GDB:636174 


TVC A TTMT7CC A T ITHQ^yf A T FlOA/TTM A MT 

NONSYNDROMIC SENSORINEURAL, 5; 
DFNA5 




DLD 


GDB: 12060b 


T TOC\ A TV/fTPM? TYPI4VTVR OfrFKI A QT7 1 

DEFICIENCY, LACTIC ACIDOSIS DUE 
TO 


15 


DYT11 


GDB: 1001 3754 


MYOCLONUS, HEREDITARY ( 
ESSENTIAL 




EEC1 


GDB-.136338 


ECTRODACTYLY, ECTODERMAL 
DYSPLASIA, AND CLEFT LIP/PALATE; 
EEC 


20 


ELN 


GDB: 119107 


ELASTIN; ELN WILLIAMS-BEUREN 
SYNDROME; WBS 


ETV1 


GDB:335229 


ETS VARIANT GENE 1; ETV1 




FKBP6 


GDB:9955215 


WILLIAMS-BEUREN SYNDROME; WBS 


25 


GCK 


GDB: 127550 


DIABETES MELLITUS, AUTOSOMAL 
DOMINANT, TYPE II GLUCOKINASE; 
GCK i 


GHRHR 


GDB.138465 


GROWTH HORMONE-RELEASING 

HUKMUiNri KbCJir 1 <JK, ijrriKxLK 




GHS 


GDB:9956363 


MICROSOMIA WITH RADIAL DEFECTS 


30 


GLD 


GDB: 119990 


PALLISTER-HALL SYNDROME; PHS 

r±i t "K'PTTPPPT FA 1VTTT V MFMRFR 1* 

GLI3 POSTAXIAL POLYDACTYLY, 
TYPE Al GREIG 
CEPHALOPOLYSYNDACTYLY 
SYNDROME; GCPS 




GPDS1 


GDB:9956410 


GLAUCOMA, PIGMENT-DISPERSION 
TYPE 


35 


GUSB 


GDB: 120025 


MUCOPOLYSACCHARIDOSIS TYPE VII 
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X T A r\IT 

iADH 




TTYDROXYACYL-CoA 
DEHYDROGENASE/3-KETOACYL-CoA 
THIOLASE/ENOYL-CoA HYDRATASE, 


5 


HLXB9 




HftMFO ROY GENE HB9" HLXB9 
SACRAL AGENESIS, HEREDITARY, 
WITH FRboALKAL JVLAoo, AIN iCKJWJts. 
MENINGOCELE, 




H0XA13 


GDB:120656 


HOMEO BOX A13; HOXA13 


10 


HPFH2 


GDB:128071 


HEREDITARY rfc-KMo 1 JiJNUi Ur 
FETAL HEMOGLOBIN, 
HETEROCELLULAR, INDIAN 




HRX 


GDB:9958999 


HRX 




IAB 


GDB: 11498909 \ 


ANEURYSM, INTRACRANIAL BERRY 


15 


IMMP2L 


GDB:11499195 


GILLES DE LA TOURETTE 
SYNDROME; GTS 




KCNH2 


GDB:138126 


LONG QT SYNDROME, TYPE 2; LQT2 




LAMB1 


GDB: 11 9357 


LAMININ BETA 1; LAMB1 




LEP 


GDB: 136420 


LEPTIN; LEP 


20 


MET 


GDB: 120178 


MET PROTO-ONCOGENE; MET 




NCF1 


GDB: 120222 


GRANULOMATOUS DISEASE, 
CHRONIC, AUTOSOMAL 
CYTOCHROME-b-POSITIVE FORM 


25 


NM 


GDB: 11 9454 


NEUTROPHIL CHEMO 1 AL 1 1C 
RESPONSE; NCR 




OGDH 


GDB: 118847 


ALPHA-KETOGLUTARA 1 b 
DEHYDROGENASE DEFICIENCY 




OPN1SW 


GDB: 119032 


TRITANOPIA 


30 


PEX1 


GDB:9787110 


/jH/LjLj W I^LtJCJln. o I INJL^J\.VylVlIZ» 5 Z-»C> 

PEROXIN-l;PEXl 




PGAM2 


GDB: 120280 


PHOSPHOGLYCERATE MUTASE, 
DEFICIENCY OF M SUBUNIT OF 




PMS2 


GDB:386406 


POSTMEIOTIC SEGREGATION 
INCREASED (S. CEREVISIAE)-2; PMS2 


35 


PON1 


GDB:120308 


PARAOXONASE 1; PON1 
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OMIM Link 




PPP1R3A 


GDB: 136797 


PROTEIN PHOSPHATASE 1, 

n-n/^Tn a two \r /TKTtjrminr<^o\ ct tot fKI r 1 1 
REGULAIOKY ^INrllDl 1 UK) dUBUXNII 

3; PPP1R3 


5 


PRSS1 


GDB: 119620 


PANCREATITIS, HEREDITARY; PCTT 
PROTEASE, SERINE, 1; PRSS1 




PTC 


GDB:1 18744 


PHENYLTHIOCARB AMIDE TASTING 


10 


PTPN12 


GDB: 136846 


PROTE1N-TYROSINE PHOSPHATASE, 
NONRECEPTOR TYPE, 12;PTPN12 


RP10 


GDB: 138786 


RETINITIS PIGMENTOSA-10; RP10 




RP9 


GDB:333931 


RETINITIS PIGMENTOSA-y; Krv 




SERPINE1 


GDB: 120297 


PLASMINOGEN ACTIVATOR 
INHIBITOR, TYPE I; PAH 


15 


SGCE 


GDB:9958714 


MYOCLONUS, HEREDITARY 
ESSENTIAL 




SHFM1 


GDB:128195 


opt TT MAMD/FrjOT F)FFOR MTT Y TYPE 
I; SHFD1 


20 


SHH 


GDB:456309 


HOLOPROSENCEPHALY, TYPE 3; HPE3 
HOMOLOG OF; SHH 




SLC26A3 


GDB:138165 


DOWN-REGULATED IN ADENOMA; 
DRA CHLORIDE DIARRHEA, 
FAMILIAL; CLD 


25 


SLC26A4 


GDB:5584511 


PENDRED SYNDROME; PDS 
DEAFNESS, NEUROSENSORY, 
AUTOSOMAL RECESSIVE, 4; DFNB4 




SLOS 


GDB:385950 


SMITH-LEMLI-OPITZ SYNDROME 




SMAD1 


GDB:3763345 


CDTXT A T A/Tf TQ/^T TT AP A TP fYPT-IV 

DISTAL, WITH UPPER LIMB 


30 


TBXAS1 


GDB: 128744 


THROMBOXANE A SYNTHASE 1; 
TBXAS1 




TWIST 


GDB: 135694 


ACROCEPHALOSYNDACTYLY TYPE 
m TWIST, DROSOPHILA, HOMOLOG 
OF; TWIST 




ZWS1 


GDB: 1205 11 


ZELLWEGER SYNDROME; ZS 
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Table 9: Genes, Locations and Genetic Disorders on Chromosome 8 



Gene 


GDB AccessionED 


OMIM Link 


ACHM3 


GDB:9 120558 


PINGELAPESE BLINDNESS 


ADRB3 


GDB:203869 


BETA-3-ADRENERGIC RECEPTOR; ADRB3 


ANK1 


GDB: 11 8737 


SPHEROCYTOSIS, HEREDITARY; HS 


CA1 


GDB: 119047 


CARBONIC ANHYDRASE I, ERYTHROCYTE, 
ELECTROPHORETIC VARIANTS OF; CA1 


CA2 


GDB: 119739 


OSTEOPETROSIS WITH RENAL TUBULAR 
ACIDOSIS 


CCAL1 


GDB:5 12892 


CHONDROCALCINOSIS WITH 
EARLY-ONSET OSTEOARTHRITIS; CCAL2 


CLN8 


GDB:252118 


EPILEPSY, PROGRESSIVE, WITH MENTAL 
RETARDATION; EPMR 


CMT4A 


GDB: 138755 


CHARCOT-MARIE-TOOTH NEUROPATHY 
4A; CMT4A 


CNGB3 


GDB:9993286 


PINGELAPESE BLINDNESS 


COH1 


GDB-.252122 


COHEN SYNDROME; COH1 


CPP 


GDB: 119798 


CERULOPLASMIN; CP 


CRH 


GDB: 119804 


CORTICOTROPIN-RELEASING HORMONE; 
CRH 


CYP11B1 


GDB: 120603 


ADRENAL HYPERPLASIA, CONGENITAL, 
DUE TO 1 1 -@BETA-HYDROXYLASE 
DEFICIENCY 


CYP11B2 


GDB: 1205 14 


CYTOCHROME P450, SUBFAMILY XIB, 
POLYPEPTIDE 2; CYP1 1B2 


DECR1 


GDB:453934 


2,4-@DIENOYL-CoA REDUCTASE; DECR 


DPYS 


GDB:5885803 


DIHYDROPYRIMIDINASE; DPYS 


DURS1 


GDB:9958126 


DUANE SYNDROME 


EBS1 


GDB: 119856 


EPIDERMOLYSIS BULLOSA SIMPLEX, 
OGNA TYPE 


ECA1 


GDB: 107963 18 


JUVENILE ABSENCE 


EGI 


GDB: 128830 


EPILEPSY, GENERALIZED, IDIOPATHIC; 
EGI 



10 



15 



20 



25 



30 



35 
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Gene 


GDB AccessionlD 


OMIM Link 




EXT1 


GDB:135994 


EXOSTOSES, MULTIPLE, TYPE I; EXT1 
CHONDROSARCOMA 


5 


EYA1 


GDB:5215167 


BRANCfflOOTORENAL DYSPLASIA EYES 
ABSENT 1; EYA1 




FGFR1 


GDB: 119913 


ACROCEPHALOSYNDACTYLY 1 YFfc. V 
FIBROBLAST GROWTH FACTOR 
RECEPTOR- 1 ; FGFR1 


10 


GNRH1 


GDB:133746 


GONADOTROPIN-RELEASINGHORMUNb 1; 
GNRH1 FAMILIAL HYPOGONADOTROPHIC 




GSR 


GDB: 11 9288 


GLUTATHIONE REDUCTASE; GSR 




GULOP 


GDB: 128 078 


otUKV I 


15 


HR 


GDB: 595499 


at npcpTA TrMTVT?pCAT T^I ATRTPT4TA WTTH 

PAPULAR LESIONS HAIRLESS, MOUSE, 
HOMOLOG OF 




KCNQ3 


GDB:9787230 


CONVULSIONS, BENIGN FAMILIAL 
NEONATAL, TYPE 2; BFNC2 POTASSIUM 
CHANNEL, VOLTAGE-GATED, SUBFAMILY 
Q, MEMBER 3 


20 


KFM 


GDB:265291 


KLIPPEL-FEIL SYNDROME; KFS; KFM 


KWE 


GDB:9315120 


KERATOLYTIC WINTER ERYTHEMA 




LGCR 


GDB: 120698 


LANGER-GEEDION SYNDROME; LGS 




LPL 


GDB: 120700 


HYPERLIPOPROTEINEMIA, TYPE I 




MCPH1 


GDB:9834525 


MICROCEPHALY; MCT 


25 


MOS 


GDB: 119396 


TRANSFORMATION GENE: ONCOGENE 
MOS 




MYC 


GDB: 120208 


TRANSFORMATION GENE: ONCOGENE 
MYC; MYC 




NAT1 


GDB-.125364 


ARYL AMIDE ACETYLASE 1 ; AAC1 


30 


NAT2 


GDB:125365 


ISONIAZID DEACTIVATION 




NBS1 


GDB:9598211 


NIJMEGEN BREAKAGE SYNDROME 




PLAT 


GDB: 11 9496 


PLASMINOGEN ACTIVATOR, TISSUE; PLAT 


35 


PLEC1 


GDB:41 19073 


EPIDERMOLYSIS BULLOSA SIMPLEX AND 
LIMB-GIRDLE MUSCULAR DYSTROPHY 
PLECTIN 1;PLEC1 
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Gene 


GDB AccessionID 


OMIMLink 


PRKDC 


GDB:234702 


SEVERE COMBINED IMMUNODEFICIENCY 
nTSFASE-1- SCID1 PROTEIN KINASE, 
DNA-ACTIVATED, CATALYTIC SUBUNIT; 
PRKDC 


PXMP3 


GDB:131487 


PEROXIN-2; PEX2 ZELLWEGER 
SYNDROMb; 


RP1 


GDB: 120352 


RETINITIS PIGMENTOSA-1; RP1 


SCZD6 


GDB: 9864736 


DISORDER-2; SCZD2 


SFTPC 


UDB:12U3 /J 


pyyT MONARY SURFACTANT APOPROTEIN 
PSP-C 


SGM1 


GDB:135350 


KLIPPEL-FEIL SYNDROME; KFS; KFM 


SPG5A 




«5PA<?TTC PARAPLEGIA-5A AUTOSOMAL 
RECESSIVE; SPG5A i 


STAR 


GDB:635457 


STEROIDOGENIC ACUTE REGULATORY 

PPOTFTW* STAT? 


TG 


GDB: 120434 


THYROGLOBULIN; TG 


TRPS1 


GDB:594960 


TRICHORHINOPHALANGEAL SYNDROME, 
TYPE I; TRPS1 


TTPA 


GDB:512364 


VITAMIN E, FAMILIAL ISOLATED 
DEFICIENCY OF; VED TOCOPHEROL 
(ALPHA) TRANSFER PROTEIN; TTPA 


VMD1 


GDB: 11 9631 


MACULAR DYSTROPHY, ATYPICAL 
VTTELLIFORM; VMD1 


WRN 


GDB: 128446 


WERNER SYNDROME; WRN 



Table 10: Genes, Locations and Genetic Disorders on Chromosome 9 



Gene 


GDB AccessionID 


OMIM Link 


ABCA1 


GDB:305294 


ANALPHALEPOPROTEINEMIA 
ATP-BINDING CASSETTE 1; ABC1 


ABL1 


GDB: 11 9640 


ABELSON MURINE LEUKEMIA VIRAL 
ONCOGENE HOMOLOG 1; ABL1 


ABO 


GDB:1 18956 


ABO BLOOD GROUP; ABO 


ADAMTS13 


GDB:9956467 


THROMBOCYTOPENIC PURPURA 


AK1 


GDB: 119664 


ADENYLATE KINASE-1; AK1 
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Gene 


GDB AccessionID 


OMIM Link 




ALAD 


GDB: 119665 


DELTA-AMINOLEVULINATE 
DEHYDRATASE; ALAD 


5 


ALDH1A1 


GDB: 119667 


ALDEHYDE DEHYDROGENASE- 1; ALDH1 




ALDOB 


GDB: 119669 


FRUCTOSE INTOLERANCE, HEREDITARY 




AMBP 


GDB:120696 


PROTEIN HC; HCP 


10 


AMCD1 


GDB:437519 


ARTHROGRYPOSIS MULTIPLEX 
CONGENITA, DISTAL, TYPE 1; AMCD1 


ASS 


GDB: 119010 


CITRULLINEMIA 




BDMF 


GDB:9954424 


BONE DYSPLASIA WITH MEDULLARY 
FIBROSARCOMA 




BSCL 


GDB:9957720 


SEIP SYNDROME 


15 


C5 


GDB: 119734 


COMPLEMENT COMPONENT-5, 
DEFICIENCY OF 




CDKN2A 


GDB:335362 


MELANOMA, CUTANEOUS MALIGNANT, 2; 
CMM2 CYCLIN-DEPENDENT KINASE 
INHIBITOR 2A; CDKN2A 


20 


CHAC 


GDB:6268491 


CHOREOACANTHOCYTOSIS; CHAC 


CHH 


GDB: 138268 


CARTILAGE-HAIR HYPOPLASIA; CHH 




CMD1B 


GDB:677147 


CARDIOMYOPATHY, DILATED IB; CMD1B 




COL5A1 


GDB: 13 1457 


COLLAGEN, TYPE V, ALPHA- 1 
POLYPEPTIDE; COL5A1 


25 


CRAT 


GDB:359759 


CARNITINE ACETYLTRANSFERASE; CRAT 




DBH 


GDB:1 19836 


DOPAMINE BETA-HYDROXYLASE, 
PLASMA; DBH 




DFNB11 


GDB: 12201 80 


DEAFNESS, NEUROSENSORY, 
AUTOSOMAL RECESSrVE, 7; DFNB7 


30 


DFNB7 


GDB:636178 


DEAFNESS, NEUROSENSORY, 
AUTOSOMAL RECESSIVE, 7; DFNB7 




DNAI1 


GDB: 11 500297 


IMMOTEJS CILIA SYNDROME-1; ICS1 




DYS 


GDB: 137085 


DYSAUTONOMIA, FAMILIAL; DYS 




DYT2 


GDB: 119854 


DYSTONIA 1, TORSION; DYT1 


35 


ENG 


GDB: 137193 


ENDOGLIN; ENG 
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Gene 


GDB AccessionlD 


OMIM Link 




EPB72 


GDB: 128993 


ERYTHROCYTE SURFACE rROIhJUN BAND 
7.2; EPB72 STOMATOCYTOSIS I 


5 


FANCC 


GDB: 132672 


FANCONI ANEMIA, COMPLEMENTATION 
GROUP C; FACC 




FBP1 


GDB:141539 


FRUCTOSE-l,6-BISPHOPHATASE 1;FBP1 




FCMD 


GDB:250412 


FUKUYAMA-TYPE CONGENITAL 
MUSCULAR DYSTROPHY; FCMD 


10 


FRDA 


GDB:1 19951 


FRIEDREICH ATAXIA 1; FRDA1 




GALT 


GDB: 11 9971 


GALACTOSEMIA 




GLDC 


GDB: 128611 


HYPERGLYCINEMIA, ISOLATED 
NONKETOTIC, TYPE I; NKH1 




GNE 


GDB:9954891 


INCLUSION BODY MYOPATHY; IBM2 


15 


GSM1 


GDB:9784210 


GENIOSPASM 1; GSM1 




GSN 


GDB: 120019 


AMYLOIDOSIS V GELSOLIN; GSN 




HSD17B3 


GDB:347487 


PSEUDOHERMAPHRODITISM, MALE, WITH 
GYNECOMASTIA 


20 


HSN1 


GDB:3853677 


NEUROPATHY, HEREDITARY SENSORY, 
TYPE 1 | 




IBM2 


GDB:3801447 


INCLUSION BODY MYOPATHY; IBM2 




LALL 


GDB:9954426 


LEUKEMIA, ACUTE, WITH 
LYMPHOMATOUS FEATURES; LALL 




LCCS 


GDB:386141 


LETHAL CONGENITAL CONTRACTURE 
SYNDROME; LCCS 




LGMD2H 


GDB:9862233 


DYSTROPHY, HUTTERITE TYPE 




LMX1B 


GDB:9834526 


NAIL-PATELLA SYNDROME; NPS1 


30 


MLLT3 


GDB: 13 8172 


MYELOID/LYMPHOID OR MIXED LINEAGE 
LEUKEMIA, TRANSLOCATED TO, 3; MLLT3 


MROS 


GDB:9954430 


MELKERSSON SYNDROME 




MSSE 


GDB:128019 


EPITHELIOMA, SELF-HEALING SQUAMOUS 




NOTCH1 


GDB:131400 


NOTCH, DROSOPHILA, HOMOLOG OF, 1; 
NOTCH1 


35 


ORM1 


GDB: 120250 


OROSOMUCOID 1; ORM1 
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Gene 


GDB Accession© 


OMIM Link 




PAPPA 


GDB:134729 


PREGNANCY-ASSOCIATED PLASMA 
PROTEIN A; PAPPA 


5 


PIP5K1B 


GDB:686238 


. . i c _ — it t a rp a \rr a 1 l'<»T> A A 

FRIEDREICH ATAXIA 1; FRDA1 




PTCH 


GDB: 11 9447 


BASAL CELL NEVUS SYNDROME; BCNS 
PATCHED, DROSOPHILA, HOMOLOG OF; 
PTCH 


10 


PTGS1 


GDB: 128070 


PROSTAGLANDIN-ENDOPEROXIDASE 
SYNTHASE 1; PTGS1 


RLN1 


GDB:1 19552 


RELAXIN; RLN1 




RLN2 


GDB: 119553 


RELAXIN, OVARIAN, OF PREGNANCY 


15 


RMRP 


GDB: 120348 


MITOCHONDRIAL RNA-PROCESSING 
ENDORIBONUCLEASE, RNA COMPONENT 

HYPOPLASIA; CHH 




ROR2 


GDB: 136454 


BRACHYDACTYLY, TYPE B; BDB 
ROBINOW SYNDROME, RECESSIVE FORM 
NEUROTROPHIC TYROSINE KINASE, 
RECEPTOR-RELATED 2; NTRKR2 


20 


RPD1 


GDB.9954440 


RETINITIS PIGMENTOSA-DEAFNESS 
SYNDROME 1, AUTOSOMAL DOMINANT 




SARDH 


GDB:9835149 


SARCOSINEMIA 




TDFA 


GDB:9954420 


FACTOR, AUTOSOMAL 


25 


TEK 


GDB:344185 


\ TT7KTr\T TO N/1 AT T7^T>A/1 A TTHMC \A1 IT TTPT TJ 

CUTANEOUS AND MUCOSAL; VMCM TEK 
TYROSINE KINASE, ENDOTHELIAL; TEK 




TSC1 


GDB:120735 


TUBEROUS SCLEROSIS-1; TSC1 




TYRP1 


GDB:126337 


TYROSINASE-RELATED PROTEIN 1; TYRP1 
ALBINISM III XANTHISM 




XPA 


GDB: 125363 


XERODERMA PIGMENTOSUM I 



Table 1 1 : Genes, Locations and Genetic Disorders on Chromosome 10 





Gene 


GDB Accession ID 


OMIM Link 




CACNB2 


GDB: 132014 


CALCRJM CHANNEL, 








VOLTAGE-DEPENDENT, BETA-2 


35 






SUBUNIT; CACNB2 
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Gene 


GDB Accession ID 


OMIMLink 




COL17A1 


GDB: 131396 


COLLAGEN, TYPE XVH, ALPHA- 1 
POLYPEPTIDE; COL17A1 


5 


CUBN 


GDB:636049 


MEGALOBLASTIC ANEMIA 1; MGA1 




CYP17 


/ij>n .11 noon 


ATYDT7MAT T-FVPPPTH ACTA 

CONGENITAL, DUE TO 

1 7- ALPHA-HYDROX YL ASE ! 

DEFICIENCY 


10 


CYP2C19 


GDB:1 19831 


CYTOCHROME P450, SUBFAMILY 
IIC, POLYPEPTIDE 19; CYP2C19 




CYP2C9 


GDB:131455 


CYTOCHROME P450, SUBFAMILY 
EC, POLYPEPTIDE 9; CYP2C9 




EGR2 


GDB: 120611 


EARLY GROWTH RESPONSE-2; 
EGR2 


15 


EMX2 


GDB:277886 


EMPTY SPIRACLES, DROSOPHRA, 
2, HOMOLOG OF; EMX2 




EPT 


GDB:9786112 


TJDTT T7DOV 'DACTTAT • EDT 

brlJLh.ro Y , rAK 1 1AJL, rlr 1 


20 


ERCC6 


/rrx-n . 1 i noon 

GDBtl 19882 


iiXt_.loJ.UJN -KbJr A1K. 
CROSS-COMPLEMENTING RODENT 
REPAIR DEFICIENCY, 
COMPLEMENTATION 




FGFR2 


GDB: 127273 


ACROCEPHALOSYNDACTYLY 
TYPE V FIBROBLAST GROWTH 
FACTOR RECEPTOR-2; FGFR2 




HK1 


GDB: 120044 


HEXOKINASE-1; HK1 


25 


HOX11 


GDB: 119607 


HOMEO BOX-1 1; HOX1 1 




HPS 


GDB: 127359 


HERMANSKY-PUDLAK 
SYNDROME; HPS 




IL2RA 


GDB: 119345 


INTERLEUKIN-2 RECEPTOR, 
ALPHA; IL2RA 


30 


LGI1 


GDB:9864936 


EPILEPSY, PARTIAL; EPT 




LIPA 


GDB:120153 


WOLMAN DISEASE 




MAT1A 


GDB: 129077 


METHIONINE 

ADENOSYLTRANSFERASE 

DEFICIENCY 


35 


MBL2 


GDB.120167 


MANNOSE-BINDING PROTEIN, 
SERUM; MBP1 
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Gene 


GDB Accession ID 


OMJM Link 




MKI67 


GDB:120185 


PROLIFERATION-RELATED Ki-67 
ANTIGEN; MKI67 


5 


MXI1 


GDB: 1371 82 


MAX INTERACTING PROTEIN 1 ; 
MXI1 




OAT 


GDB: 120246 


ORNITHINE AMINOTRANSFERASE 
DEFICIENCY 


10 


OATL3 


GDB:215803 


ORNITHINE AMINOTRANSFERASE ' 
DEFICIENCY 


PAX2 


GDB: 138771 


PAIRED BOX HOMEOTIC GENE 2; 
PAX2 




PCBD . 


(jDd:13o4/o 


DEHYDRATASE; PCBD 
PRIMAPTERJNURIA 


15 


PEOl 


GDB:632784 


PROGRESSIVE EXTERNAL 
OPHTHALMOPLEGIA; PEO 




PHYH 


GDB:9263423 


REFSUM DISEASE 
PHYTANOYL-CoA HYDROXYLASE; 
PHYH 


20 


PNLIP 


GDB:127916 


LIPASE, CONGENITAL ABSENCE OF 
PANCREATIC 




PSAP 


GDB: 120366 


PROSAPOSIN; PSAP 


25 


PTPTsJ 


GDR-6022948 


MACROCEPHALY MULTIPLE 
LIPOMAS AND HEMANGIOMATA 
MULTIPLE HAMARTOMA 
SYNDROME; MHAM POLYPOSIS, 
J U VlilNLLli 1IN 1 Jio 1 JJNAJL 
PHOSPHATASE AND TENSIN 
HOMOLOG; PTEN 




RBP4 


GDB: 120342 


RETINOL-BINDING PROTEIN, 
PLASMA; RBP4 


30 


RDPA 


GDB:9954445 


REFSUM DISEASE WITH 
INCREASED PIPECOLICACIDEMIA; 
RDPA 




RET 


GDB: 120346 


RET PROTO-ONCOGENE; RET 




SDF1 


GDB:433267 


STROMAL CELL-DERIVED FACTOR 
1;SDF1 



35 
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GDB Accession ID 


OMEM Link 


SFTPA1 


GDB:1 19593 


PULMONARY SURFACTANT 
APOPROTEIN PSP-A; PSAP 


SFTPD 


GDB: 132674 


PULMONARY SURFACTANT 
APOPROTEIN PSP-D; PSP-D 


SHFM3 


GDB:386030 


SPLIT-HAND/FOOT 

MAT TTOP \A A TTfYW TVPT7 7« CT-TP\>fl 


SIAL 


GDB.6549924 


NEURAMINIDASE DEFICIENCY 


THC2 


GDB: 10794765 


THROMBOCYTOPENIA 


TNFRSF6 


GDB: 132671 


APOPTOSIS ANTIGEN 1 ; APT1 j 


UFS 


GDB:6380714 


UROFACIAL SYNDROME; UFS 


UROS 


GDB:128112 


PORPHYRIA, CONGENITAL 
ERYTHROPOIETIC; CEP 



Table 12: Genes, Locations and Genetic Disorders on Chromosome 1 1 



Gene 


GDB Accession ID 


OMIM Link 


AA 


GDB:568984 


ATROPHIA AREATA; AA 


ABCC8 


GDB:591370 


SULFONYLUREA RECEPTOR; SUR 
PERSISTENT HYPERINSULINEM1C 
HYPOGLYCEMIA OF INFANCY 


ACATl 


GDB: 126861 


ALPHA-METHYLACETOACETICACIDURIA 


ALX4 


GDB: 10450304 


PARIETAL FORAMINA, SYMMETRIC; PFM 


AMPD3 


GDB:136013 


ADENOSINE MONOPHOSPHATE 
DE AMIN ASE-3 ; AMPD3 


ANC 


GDB:9954484 


CANAL CARCINOMA 


APOA1 


GDB: 119684 


AMYLOIDOSIS, FAMILIAL VISCERAL 
APOLIPOPROTEIN A-I OF HIGH DENSITY 
LIPOPROTEIN; APOA1 


APOA4 


GDB: 119000 


APOLIPOPROTEIN A-P7; APOA4 


APOC3 


GDB: 119001 


APOLIPOPROTEIN C-ffl; APOC3 


ATM 


GDB:593364 


ATAXIA-TELANGIECTASIA; AT j 


BSCL2 


GDB:9963996 


SEIP SYNDROME 


BWS 


GDB: 120567 


BECKWITH- WIEDEMANN SYNDROME; 
BWS 
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Gene 


GDB Accession ID 


OMIM Link 




CALCA 


GDB:120571 


C ALCITON IJN/C AlA-,1 1 UiNJJN-KGL.A 1 JSD 

POLYPEPTIDE, ALPHA; CALCA 


5 


CAT 


GDB:1 19049 


CATALASE; CAT 




CCND1 


GDB: 128222 


LEUKEMIA, CHRONIC LYMPHATIC; CLL 
CYCLIN Dl; CCND1 




CD3E 


GDB:1 19764 


CD3E ANTIGEN, EPSILON POLYPEPTIDE; 
CD3E 


10 


CD3G 


GDB: 11 9765 


T3 T-CELL ANTIGEN, GAMMA CHAIN; 
T3G; CD3G 




CD59 


GDB: 119769 


CD59 ANTIGEN PI 8-20; CD59 HUMAN 
LEUKOCYTE ANTIGEN MICH; MICH 


15 


CDKN1C 


GDB:593296 


CYCLIN-DEPENDENT KINASE INHIBITOR 
1C; CDKN1C 


CLN2 


GDB: 125228 


LATE INFANTILE TYPE; CLN2 




CNTF 


GDB:125919 


CILIARY NEUROTROPHIC FACTOR; CNTF 


20 


CPT1A 


GDB:597642 


HYPOGLYCEMIA, HYPOKETOTIC, WITH 
DEFICIENCY OF CARNITINE 
PALMITOYLTRANSFERASE CARNITINE 

DAT A/fTTnVT TP AMQPPP A T T TV PR • 

CPT1A 


25 


CTSC 


GDB:642234 


KERATOSIS PALMOPLANTARIS WITH 
PERIODONTOPATHIA KERATOSIS 
PALMOPLANTARIS WITH 
PERIODONTOPATHIA AND 
ONYCHOGRYPOSIS CATHEPSIN C; CTSC 




DDB1 


GDB:595014 


DNA DAMAGE-BINDING PROTEIN; DDB1 




DDB2 


GDB:595015 


DNA DAMAGE-BINDING PROTEIN-2; DDB2 




DHCR7 


GDB:9835302 


SMITH-LEMLI-OPITZ SYNDROME 


30 


DLAT 


GDB: 11 8785 


CIRRHOSIS, PRIMARY; PBC 




DRD4 


GDB: 127782 


DOPAMINE RECEPTOR D4; DRD4 




ECB2 


GDB:9958955 


POLYCYTHEMIA, BENIGN FAMILIAL 




ED4 


GDB:9837373 


DYSPLASIA, MARGARITA TYPE 



35 
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Gene 


GDB Accession ID 


OMIM Link 


5 


EVR1 ! 


GDB.134029 


EXUDATIVE VITREORETINOPATHY, 

EAlVifTT I AT • TTVTi 

EXT2GDB:344921EXOSTOSES, MULTIPLE, 
TYPE II; EXT2 CHONDROSARCOMA 




F2 


GDB:1 19894 


COAGULATION FACTOR II; F2 




FSHB 


GDB: 119955 


FOLLICLE-STIMULATING HORMONE, ' 
BETA POLYPEPTIDE; FSHB 


10 


FTH1 


GDB:120617 


FERRITIN HEAVY CHAIN 1; FTH1 


GIF 


GDB:1 18800 


PERNICIOUS ANEMIA, CONGENITAL, Dub 
TO DEFECT OF INTRINSIC FACTOR 




GSD1B 


GDB:9837619 j 


GLYCOGEN STORAGE DISEASE lb 




GSD1C 


GDB:9837637 


STORAGE DISEASE Ic 


15 


HBB 


GDB: 119297 


HEMOGLOBIN-BETA LOCUS; HBB 




HBBP1 


GDB:120035 


HEMOGLOBIN— BETA LOCUS; HBB 




HBD 


GDB: 119298 


HEMOGLOBIN-DELTA LOCUS; HBD 




HBE1 


GDB: 119299 


HEMOGLOBIN— EPSILON LOCUS; HBE1 


20 


HBG1 


GDB:1 19300 


HEMOGLOBIN, GAMMA A; HBG1 


HBG2 


GDB: 119301 


HEMOGLOBIN, GAMMA G; HBG2 




HMBS 


GDB: 120528 


PORPHYRIA, ACUTE INTERMITTENT; AIP 




HND 


GDB:9954478 


HARTNUP DISORDER 




HOMG2 


GDB-.9956484 


MAGNESIUM WASTING, RENAL 


25 


un a c 
HKAo 




RT ADDFR CANCER V-HA-RAS HARVEY 
RAT SARCOMA VIRAL ONCOGENE 

T Trf*VJV jT/"\T C\f^ . UD A c 

HOMULUCj; HKAo 




HVBS1 


GDB: 120069 


CAN CbK, rlbr A 1 LIUcJL/L/U LAK 


30 


IDDM2 


KjUHAZoDii) 


niARPTPQ MFT T TTIJS 
ftDij A jdo jlvif ti ti /X iujj 

INSULIN-DEPENDENT, 2 DIABETES 
MELLITUS, JUVENILE-ONSET 
INSULIN-DEPENDENT; IDDM 




IGER 


GDB: 119696 


IgE RESPONSIVENESS, ATOPIC; IGER 




INS 


GDB:1 19349 


INSULIN; INS j 


35 


JBS 


GDB:120111 


JACOBSEN SYNDROME; JBS 
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Gene 


GDB Accession ID 


OMIM Link 


5 


KCNJU 


GDB:7009893 


POTASSIUM CHANNEL, 

IMWARDI V-RFCTTFYTNG SUBFAMILY J 

MEMBER 11; KCNJ11 PERSISTENT 

HYPERJNSULllNiiMIC/ HYPOGLYCEMIA vjr 

INFANCY 




KCNJ1 


GDB:204206 


POTASSIUM CHANNEL, 
INWARDLY-RECTIFYING, SUBFAMILY J, 
MEMBER 1; KCNJ1 




KCNQ1 


GDB:741244 


LONG QT SYNDROME, TYPE 1; LQT1 


10 


LDHA 


GDB:120141 


LACTATE DEHYDROGENASb-A; LDHA 




LRP5 


GDB:9836818 


OSTEOPOROSIS-PSEUDOGLIOMA 
SYNDROME; OPPG HIGH BONE MAbo 


15 


MEN1 


GDB: 1201 73 


MULTIPLE ENDOCRINE NEOPLASIA, TYPE 
1;MEN1 


X JfT T 

MLL 


vjrJJli:12ooiy 


iwTVPT OTTi/T VA/TPWOTD OP MTXT*D-T TNFAGF 

1V1 i rVI A i\,t,Jl 1_* I iVLX llWJUL/ V-rX\. 1VXL/\J-/1^ X^XlNJJ/jrWWi-/ 

LEUKEMIA; MLL 




MTACR1 


GDB: 125743 


MULTIPLE TUMOR ASSOCIATED 
CHROMOSOME REGION 1; MTACR1 


20 


MYBPC3 


GDB:579615 


CARDIOMYOPATHY, FAMILIAL 
HYPERTROPHIC, 4: CMH4 
MYOSIN-BINDING PROTEIN C, CARDIAC; 

ivl I DrK^D 


25 


MY07A 


GDB: 132543 


MYOSIN VHA; MY07A DEAFNESS, 

xrcT TP riQPXIQfYR V AT TTfi^OK/TAT 

RECESSIVE, 2; DFNB2 DEAFNESS, 
AUTOSOMAL DOMINANT 

■v x y^v-v i-ft^c TT\T\ /\t J-T/-1 flT^TL TO /\T*TXT'PT TO AT "11. 

NONSYNDROMIC SENSORINEURAL, 11; 
DFNA11 




NNOl 


GDB:10450513 


SIMPLE, AUTOSOMAL DOMINANT 




OPPG 


GDB:3789438 


OSTEOPOROSIS-PSEUDOGLIOMA 
SYNDROME; OPPG 


30 


OPTB1 


GDB:9954474 


OSTEOPETROSIS, AUTOSOMAL 
RECESSIVE 




PAX6 


GDB: 11 8997 


PAIRED BOX HOMEOTIC GENE 6; PAX6 




PC 


GDB: 119472 


PYRUVATE CARBOXYLASE DEFICIENCY 


35 


PDX1 


GDB:9836634 


PYRUVATE DEHYDROGENASE COMPLEX, 
COMPONENT X 
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Gene 


GDB Accession ID 


OMEVI Link 




PGL2 


GDB:511177 


PARAGANGLIOMAS, FAMILIAL 
NONCHROMAFFIN, 2; PGL2 


5 


PGR 


GDB: 119493 


PROGESTERONE RESISTANCE 




PORC 


GDB:128610 


PORPHYRIA, CHESTER TYPE; PORC 








PARATHYROID HORMONE; PTH 




DTQ 


\JkJD , 1 1 OOJU 


6-fffiPYRUVOYLTETRAHYDROPTERIN 
SYNTHASE; PTS 


10 


P\/T?T 1 
r VKJH 


VJL/D.JOJ7J 1 


ECTODERMAL DYSPLASIA, CLEFT LIP 
AND PALATE, HAND AND FOOT 
DEFORMITY, DYSPLASIA, MARGARITA 

TVDT7 T>r\J TfWTPTTQ PRPPPTOR RFT ATFD" 

PVRR 


15 . 


PYGM 


GDB:120329 \ 


GLYCOGEN STORAGE DISEASE V 


RAG1 


GDB: 120334 


RECOMBINATION ACTIVATING GENE-1; 
RAG1 




RAG2 


GDB: 125 186 


RECOMBINATION ACTIVATING GENE-2; 
RAG2 




ROM1 


GDB:120350 


ROD OUTER SEGMENT PROTEIN- 1; KOMI 


20 


SAA1 


GDB: 120364 


SERUM AMYLOID Al ; SAA1 




SCA5 


GDB:378219 


SPINOCEREBELLAR ATAXIA 5; SCA5 




SCZD2 


GDB: 11 8874 


DISORDER-2; SCZD2 


25 


SDHD 


GDB: 132456 


DAD TAKi AC T7A\jfTTTAT 

r AKALr AJN Lj\L1UJV1 Ao, r AMLL/IAJL. 
NONCHROMAFFIN, 1; PGL1 




SERPING1 


GDB:1 19041 


ANGIONEUROTIC EDEMA, HEREDITARY; 
HANE 




SMPD1 


GDB:128144 


NIEMANN-PICK DISEASE 


30 


TCTRfsi 

1 V^XXvVJ 1 


GDB-9956269 


OSTEOPETROSIS, AUTOSOMAL 
RECESSIVE 


TCL2 


GDB:9954468 


LEUKEMIA, ACUTE T-CELL; ATL 


35 


TECTA 


GDB:6837718 


DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 8; 
DFNA8 DEAFNESS, AUTOSOMAL 
DOMINANT NONSYNDROMIC 
SENSORINEURAL, 12; DFNA12 
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Gene 


GDB Accession ID 


OMIM Link 




TH 


GDB:1 19612 


i-n^TT* /^votxttt uVT\PnYVT ACT?* TUT 

TYROSlNh, rl Y UKUA Y LAoJfcs, 1 Jri 




TREH 


GDB.9958953 


TREHALASE 


5 


TSG101 


GDB:1313414 


TUMOR SUSCEPTIBILITY GENE 101; 
TSG101 




TYR 


GDB:120476 


ALBINISM I 




USH1C 


GDB: 132544 


USHER SYNDROME, TYPE IC; USH1C 




VMD2 


GDB:133795 


\/TTP?T T TT?OT?lV/f MAPTTT AR DYSTROPHY* 

VMD2 


10 


VRNI 


GDB: 135662 


INFLAMMATORY; VRNI 




XMT1 
W 1 1 


frFf!** 190496 


FRASIER SYNDROME WILMS TUMOR; 
WT1 


15 


WT2 


GDB: 118886 


MULTIPLE TUMOR ASSOCIATED 
CHROMOSOME REGION 1; MTACRl 




ZNF145 


GDB:230064 


PROMYELOCYTE LEUKEMIA ZINC 
FINGER; PLZF 



Table 13: Genes, Locations and Genetic Disorders on Chromosome 12 



25 



30 



35 



Gene 


GDB Accession ID 


OMIM Link 


A2M 


GDB: 119639 


ALPHA-2-MACROGLOBULIN; A2M 


AAAS 


GDB:9954498 


GLUCOCORTICOID DEFICIENCY 
AND ACHALASIA 


ACADS 


GDB: 11 8959 


ACYL-CoA DEHYDROGENASE, 
SHORT-CHAIN; ACADS 


ACLS 


GDB: 136346 


ACROCALLOS AL SYNDROME; ACLS 


ACVRL1 


GDB:230240 


OSLER-RENDU- WEBER SYNDROME 
2; ORW2 ACTTVIN A RECEPTOR, 
TYPE H-LIKE KINASE 1; ACVRL1 


ADHR 


GDB:9954488 


VITAMIN D-RESISTANT RICKETS, 
AUTOSOMAL DOMINANT 


ALDH2 


GDB:1 19668 


ALDEHYDE DEHYDROGENASE-2; 
ALDH2 


AMHR2 


GDB:696210 


ANTI-MULLERIAN HORMONE TYPE 
II RECEPTOR; AMHR2 


AOM 


GDB: 118998 


STICKLER SYNDROME, TYPE I; STL1 
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1 Gene 1 


GDB Accession ID | 1 


OMIMLink j 


AQP2 j 


GDB:141853 1 


AQUAPORIN-2; AQP2 DIABETES 
INSIPIDUS, RENAL TYPE DIABETES 
INSIPIDUS, RENAL TYPE, 
ai TTr^QOivyi AT RFCFSSTVE 1 


1 ATD j 

L 


GDB:696353 j 


ASPHYXIATING THORACIC 1 

nvOTD fADT-IV- ATD 1 


ATP2A2 


GDB:1 19717 


ATn««« rwO-i-A TR A^QPfYRTTWO 1 
A 1 r ase, KJdA^J^y I JKAiN oruxv. l UN vj, I 

SLOW-TWITCH; ATP2A2 
DARIER-WHITE DISEASE; DAR j 


1 BDC 1 


GDB:5584359 | 


BRACHYDACTYLY, TYPE C; BDC ] 


1 ClR 1 


GDB:1 19729 1 


COMPLEMENT COMPONENT-Clr, 1 
DEFICIENCY OF | 


1 CD4 | 


GDB:1 19767 j 


T-CELL ANTIGEN T4/LEU3 ; CD4 j 


CDK4 1 


GDB:204022 j 


CYCLIN-DEPENDENT KINASE 4; 1 
CDK4 


1 CNA1 J 


GDB:252119 j 


, CORNEA PLANA 1; CNA1 j 


I COL2A1 




1 STICKLER SYNDROME, TYPE I; STL1 | 
COLLAGEN, TYPE II, ALPHA- 1 
1 CHAIN* COL2A1 1 
1 ACHONDROGENESIS, TYPE 0; ACG2 


1 CYP27B1 1 


GDB:9835730 


RICKETS; PDDR 


IdPvPLA 


GDB:270336 


1 nDMT A TADI TDD A t PAT T mOT TTYSTA 1 

| N ATROPHY; DRPLA j 


1 ENUR2 


1 GDB:666422 


ENURESIS, NOCTURNAL, 2; ENUR2 j 


1 FEOM1 


1 GDB:345037 


1 FIBROSIS OF EXTRAOCULAR 1 
1 MUSCLES, CONGENITAL; FEOM 


1 FPF 


1 GDB:9848880 


1 PERIODIC FEVER, AUTOSOMAL 1 
DOMINANT 


) IGNB3 


1 GDB-.120005 


1 GUANINE NUCLEOTIDE-BINDING 1 
PROTEIN, BETA POLYPEPTEDE-3; 
GNB3 j 


1 GNS 


1 GDB: 120006 


MUCOPOLYSACCHARIDOSIS TYPE 1 

niD 


1 HAL 


| GDB: 120746 


| HISTIDINEMIA 1 
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Gene 


GDB Accession ID 


OMIM Link 




HBP1 


GDB:701889 


BRACHYDACTYLY Wllil 
HYPERTENSION 


5 


HMGIC 


GDB:362658 


HIGH MOBILITY GROUP PRO 1 E1IN 
ISOFORM I-C; HMGIC 




HMN2 


GDB:9954508 


MUSCULAR ATROPHY, ADULT 
SPINAL 




HPD 


GDB:135978 


TYROSINEMIA, TYPE III 


10 


IGF1 


GDB:120081 


INSULINLIKE GROWTH FACTOR 1; 
IGF1 




KCNA1 


GDB: 127903 


POTASSKJM VOLTAGE-GATED 
CHANNEL, SHAKER-RELATED 
SUBFAMILY, MEMBER 


15 


KERA 


GDB:252121 


CORNEA PLANA 2; CNA2 


KRAS2 


GDB: 1201 20 


V-KI-RAS2 KIRSTEN RAT SARCOMA 
2 VIRAL ONCOGENE HOMOLOG; 
KRAS2 




KRT1 


GDB:128198 


KERATIN 1;KRT1 


20 


KRT2A 


GDB:407640 


ICHTHYOSIS, BULLOUS TYPE 
KERATIN 2A; KRT2A 




KRT3 


GDB: 136276 


KERATIN 3; KRT3 




KRT4 


GDB: 120697 


KERATIN 4; KRT4 


25 


KRT5 


GDB:128110 


EPIDERMOLYSIS BULLOSA 
HERPETIFORMIS, DOWLING-MEARA 
TYPE KERATIN 5; KRT5 




KRT6A 


GDB: 1281 11 


KERATIN 6A; KRT6A 




KRT6B 


GDB:128113 


KERATIN 6B; KRT6B 
PACHYONYCHIA CONGENITA, 
JACKSON-LAWLER TYPE 


30 


KRTHB6 


GDB:702078 


MONILETHRIX KERATIN, HAIR 
BASIC (TYPE IB 6 




LDHB 


GDB: 120147 


LACTATE DEHYDROGENASE-B; 
LDHB 




LYZ 


GDB:120160 


AMYLOIDOSIS, FAMILIAL VISCERAL 
LYSOZYME; LYZ 


35 


MGCT 


GDB:9954504 


TESTICULAR TUMORS 
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Gene 


GDB Accession ID 


OMIM Link 




MPE 


GDB:120191 


MALIGNANT PROLIFERATION OF 




MVK 


GDB: 134189 


MEVALONICACIDURIA 


5 


MYL2 


GDB: 128829 


MYOSIN, LIGHT CHAIN, 
REGULATORY VENTRICULAR; 
MYL2 




NS1 


GDB:439388 


NOONAN SYNDROME 1; NS1 


10 


OAP 


GDB: 120245 


OSTEOARTHROSIS, PRECOCIOUS; 
OAP 




PAH 


GDB: 119470 | 


PHENYLKETONURIA; PKU1 




PPKB 


GDB:696352 


PALMOPLANTAR KERATODERMA, 
BOTHNIAN TYPE; PPKB 


15 


PRB3 


GDB:1 19513 


PAROTID SALIVARY 
GLYCOPROTEIN; Gl 




PXR1 


GDB:433739 


ZELLWEGER SYNDROME; ZS ; 
PEROXISOME RECEPTOR 1 ; PXR1 




RLS 


GDB: 11501392 


ACROMELALGIA, HEREDITARY 




RSN 


GDB: 139158 


RESTIN; RSN 


20 


SAS 


GDB: 128054 


SARCOMA AMPLIFIED SEQUENCE; 
SAS 




SCA2 


GDB: 128034 


SPINOCEREBELLAR ATAXIA 2; SCA2 
ATAXIN-2; ATX2 


25 


SCNN1A 


GDB:366596 


SODIUM CHANNEL, 
NONVOLTAGE-GATED, 1; SCNN1A 




SMAL 


GDB:9954506 


ODTXT A T \ AT TC/^T TT AD ATPOr^DXJV 

&F1N/VL MUoLULAK A 1 KAJr ri I , 
CONGENITAL NONPROGRESSIVE, 
OF LOWER LIMBS 




SPPM 


GDB:9954502 


SCAPULOPERONEAL MYOPATHY; 
SPM 


30 


SPSMA 


GDB-.9954510 


SCAPULOPERONEAL 
AMYOTROPHY, NEUROGENIC, NEW 
ENGLAND TYPE 




TBX3 

t 


GDB:681969 


ULNAR-MAMMARY SYNDROME; 
UMS T-BOX 3; TBX3 


35 


TBX5 


GDB:6175917 


HOLT-ORAM SYNDROME; HOS 
T-BOX 5; TBX5 
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Gene 


GDB Accession ID 


OMIM Link 


TCF1 


GDB:125297 


TRANSCRIPTION FACTOR 1, 
hLKrAlU^; IWl MAI UK11 i-UJNoiil 
DIABETES OF THE YOUNG, TYPE III; 
MODY3 


TPI1 


GDB:1 19617 


TRIOSEPHOSPHATE ISOMERASE 1; 
TPI1 


TSC3 


GDB: 127930 


SCLEROSIS-3; TSC3 


ULR 


GDB:594089 


UTERINE 


VDR 


GDB:120487 


VITAMIN D-RESISTANT RICKETS 
WITH END-ORGAN 
UNRESPONSIVENESS TO 
1 ,25-DIHYDROXYCHOLECALCIFERO 
L VITAMIN D RECEPTOR; VDR 


VWF 


GDB: 119125 


VON WILLEBRAND DISEASE; VWD 



10 



15 Table 14: Genes, Locations and Genetic Disorders on Chromosome 13 



Gene 


GDB Accession ID 


OMIM Link 


ATP7B 


GDB: 120494 


WILSON DISEASE; WND 


BRCA2 


GDB:387848 


BREAST CANCER 2, EARLY-ONSET; 
BRCA2 


BRCD1 


GDB:9954522 


BREAST CANCER, DUCTAL, 1; BRCD1 


CLN5 


GDB:230991 


CEROID-LIPOFUSCNOSIS, NEURONAL 5; 
CLN5 


CPB2 


GDB: 129546 


CARBOXYPEPTIDASE B2, PLASMA; CPB2 


ED2 


GDB:9834522 


ECTODERMAL DYSPLASIA, HIDROTIC; 
HED 


EDNRB 


GDB: 129075 


ENDOTHELIN-B RECEPTOR; EDNRB 
HIRSCHSPRUNG DISEASE-2; HSCR2 


ENUR1 


GDB:594516 


ENURESIS, NOCTURNAL, 1; ENUR1 


ERCC5 


GDB:120515 


EXCISION-REPAIR, COMPLEMENTING 
DEFECTIVE, IN CHINESE HAMSTER, 5; 
ERCC5 


F10 


GDB: 119890 


X, QUANTITATIVE VARIATION IN 
FACTOR X DEFICIENCY; F10 


F7 


GDB: 119897 


FACTOR Vn DEFICIENCY 



20 



25 



30 



35 
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Gene 


GDB Accession ID 


OMIM Link 


c 
D 


GJB2 


GDB: 125247 


GAP JUNCTION PROTEIN, BETA-2, 26 KD; 
GJB2 DEAFNESS, NEUROSENSORY, 
AUTOSOMAL RECESSIVE, 1; DFNB1 
t^c AtrMTTCS AT ITO^sOlVf AT DOMINANT 

NONSYNDROMJC SENSORINEURAL, 3; 
DFNA3 


10 


GJB6 


GDB:9958357 


ECTODERMAL DYSPLASIA, HIDRO 1 1C; 
HED DEAFNESS, AUTOSOMAL 
DOMINANT NONSYNDROMJC 
SENSORINEURAL, 3; DFNA3 


IPF1 


GDB:448899 


INSULIN PROMOTER FACTOR 1; IPF1 




MB SI 


GDB:128365 


MOEBIUS SYNDROME; MBS 




MCOR 


GDB:9954520 


CONGENITAL 


15 


PCCA 


GDB: 119473 


GLYCINEMIA, KETOTIC, I 


RBI 


GDB: 118734 


BLADDER CANCER RETINOBLASTOMA; 
RBI 




RHOK 


GDB:371598 


RHODOPSJN KINASE; RHOK 




SCZD7 


GDB:9864734 


DISORDER-2; SCZD2 


20 


SGCG 


GDB:3763329 


MUSCULAR DYSTROPHY, LIMB GIRDLE, 
TYPE 2C; LGMD2C 




SLC10A2 


GDB:677534 


SOLUTE CARRIER FAMILY 10, MEMBER 
2; SLC10A2 


25 


SLC25A15 


GDB:120042 


HYPERORN1THINEMIA-HYPERAMMONE 
MIA-HOMOCITRULLINURIA SYNDROME 


STARP1 


GDB:635459 


STEROIDOGENIC ACUTE REGULATORY 
PROTEIN; STAR 




ZNF198 


GDB:6382650 


ZINC FINGER PROTEDSf-198; ZNF198 


30 




Genes, Locations and Genetic Disorders on Chromosome 14 


Gene 


GDB Accession ID 


OMIM Link 




ACHM1 


GDB:1 32458 


COLORBLINDNESS, TOTAL 




ARVD1 


GDB:371339 


ARRHYTHMOGENIC RIGHT VENTRICULAR 
DYSPLASIA, FAMILIAL, 1; ARVD1 


35 


CTAA1 


GDB:265299 


CATARACT, ANTERIOR POLAR 1; CTAA1 
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Gene 


GDB Accession ID 


OMIM Link 


DAD1 


GDB:407505 


DEFENDER AGAINST CELL DEATH; DAD1 


DFNB5 


GDB:636176 


DEAFNESS, NEUROSENSORY, 
AUTOSOMAL RECESSIVE, 5; DFNB5 


EML1 


GDB-.6328385 


USHER SYNDROME, TYPE IA; USH1A 


GALC 


GDB: 11 9970 


KRABBE DISEASE 


GCHI 


Kivts. lis /yo 


DYoIvJInIA, rKUvJKJbaol VJi, W11H. LJ1UKNAJL 
VARIATION GTP CYCLOHYDROLASE I 
DEFICIENCY GTP CYCLOHYDROLASE I; 
GCHI 


HE1 


GDB:9957680 


MALFORMATIONS, MULTIPLE, WITH LIMB 
ABNORMALITIES AND HYPOPITUITARISM 


IBGC1 


GDB: 10450404 


CEREBRAL CALCIFICATION, 
NONARTERIOSCLEROTIC 


IGH@ 


GDB: 11 8731 


IgA CONSTANT HEAVY CHAIN 1; IGHA1 
IMMUNOGLOBULIN: D (DIVERSITY) 
REGION OF HEAVY CHAIN IgA CONSTANT 
HEAVY CHAIN 2; IGHA2 
IMMUNOGLOBULIN: J (JOINING) LOCI OF 
HEAVY CHAIN; IGHJ IMMUNOGLOBULIN: 
HEAVY Mu CHAIN; Mul; IGHM1 
IMMUNOGLOBULIN: VARIABLE REGION 
OF HEAVY CHAINS-Hvl; IGHV IgG HEAVY 
CHAIN LOCUS; IGHG1 IMMUNOGLOBULIN 
Gm-2; IGHG2 IMMUNOGLOBULIN Gm-3; 
IGHG3 IMMUNOGLOBULIN Gm-4; IGHG4 
IMMUNOGLOBULIN: HEAVY DELTA 
CHAIN; IGHD IMMUNOGLOBULIN: HEAVY 
EPSILON CHAIN; IGHE 


IGHC group 


GDB:9992632 


IgA CONSTANT HEAVY CHAIN 1; IGHA1 
IgA CONSTANT HEAVY CHAIN 2; IGHA2 j 
IMMT rMOOT ORTJI TM- HFAVY Mu CHAIN- 

Mul; IGHM1 IgG HEAVY CHAIN LOCUS; 
IGHG1 IMMUNOGLOBULIN Gm-2; IGHG2 
lMMUNUOLOJaULLN um-j; ICjxitjrJ 
IMMUNOGLOBULIN Gm-4; IGHG4 
IMMUNOGLOBULIN: HEAVY DELTA 
CHAIN; IGHD IMMUNOGLOBULIN: HEAVY 
EPSILON CHAIN; IGHE 


IGHG1 


GDB: 120085 


IgG HEAVY CHAIN LOCUS; IGHG1 


IGHM 


GDB:120086 


IMMUNOGLOBULIN: HEAVY Mu CHAIN; 
Mul; IGHM1 
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Gene 


GDB Accession ID 


OMIM Link 




IGHR 


GDB:9954529 


G1(A1) SYNDROME 




IV 


GDB: 139274 


INVERSUS VISCERUM 


5 


LTBP2 


GDB:453890 


LATENT TRANSFORMING GROWTH 
FACTOR-BETA BINDER PROTEIN 2; LTBP2 




MCOP 


GDB:9954527 


MICROPHTHALMOS 




MJD 


GDB: 118840 


MACHADO-JOSEPH DISEASE; MJD 


10 


MNG1 


GDB:6540062 


GOITER MULTINODULAR 1 ; MNG1 




MPD1 


GDB:230271 


MYOPATHY, LATE DISTAL HEREDITARY 




MPS3C 


GDB:9954532 


MUCOPOLYSACCHARIDOSIS TYPE HIC 




MYH6 


GDB: 120214 


MYOSIN, HEAVY POLYPEPTIDE 6; MYH6 


15 


MYH/ 


vjJJo- 1ZUZ1D 


MYH7 




Nr 






20 


rAJD.rJN 1 


vjJLIrS.DO/ 1 DO 


DYSTROPHY; OPMD OCULOPHARYNGEAL 

H M~r Tfl V*1T TT A r\t rnrpn T"*T TXT" A T T r P/^CI /~\Ti JT A T 

MUSCULAR DYSTROPHY, AUTOSOMAL 
RECESSIVE POLYADENYLATE-BINDING 
PROTEIN-2; PABP2 




PSEN1 


GDB: 135682 


ALZHEIMER DISEASE, FAMILIAL, TYPE 3; 
AD3 




PYGL 


GDB: 120328 


GLYCOGEN STORAGE DISEASE VI 


25 


RPGRIP1 


GDB: 11498766 


AMAUROSIS CONGENITA OF LEBER I 


SERPINA1 


GDB: 120289 


PROTEASE INHIBITOR 1; PI 




SERPINA3 


GDB: 118955 


ALPHA- 1 -ANTICHYMOTRYPSIN; AACT 




SERPINA6 


GDB: 127865 


CORTICOSTEROID-BINDn^G GLOBULIN; 
CBG 




SLC7A7 


GDB:9863033 


rvm a nrn a h rrxTA A /MTMini A TT 

DIBASICAMINOACIDURIA II 




SPG3A 


GDB:230126 


SPASTIC PARAPLEGIA-3, AUTOSOMAL 
DOMINANT; SPG3A 


35 


SPTB 


GDB:1 19602 


ELLIPTOCYTOSIS, RHESUS-UNLINKED 
TYPE HEREDITARY HEMOLYTIC 
SPECTRIN, BETA, ERYTHROCYTIC; SPTB 


TCL1A 


GDB:250785 


T-CELL LYMPHOMA OR LEUKEMIA 
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Gene 


GDB Accession ID 


OMEN! Link 


TCRAV17S1 


GDB:642130 


T-CELL ANTIGEN RECEPTOR, ALPHA 
SUBUNIT; TCRA 


TCRAV5S1 


GDB:451966 


T-CELL ANTIGEN RECEPTOR, ALPHA 
SUBUNIT; TCRA 


TGM1 


GDB:125299 


TRANSGLUTAMINASE 1; TGM1 
ICHTHYOSIS CONGENITA 


TITF1 


GDB: 132588 


THYROID TRANSCRIPTION FACTOR 1; 
TITF1 


TMIP 


GDB:9954523 


AND ULNA, DUPLICATION OF, WITH 
ABSENCE OF TIBIA AND RADIUS 


TRA@ 


GDB:120404 


T-CELL ANTIGEN RECEPTOR, ALPHA 
SUBUNIT; TCRA 


TSHR 


GDB: 1253 13 


THYROTROPIN, UNRESPONSIVENESS TO ] 


USH1A 


GDB: 118885 


USHER SYNDROME, TYPE IA; USH1A j 


VP 


GDB: 120492 


PORPHYRIA VARIEGATA 



10 



15 



Table 16: 



Genes, Locations and Genetic Disorders on Chromosome 15 



Gene 


GDB Accession ID 


OMIMLink 


ACCPN j 


GDB:5457725 


CORPUS CALLOSUM, AGENESIS OF, 
WITH NEURONOPATH Y 


AH02 


GDB:9954535 


HEREDITARY OSTEODYSTROPHY-2; 
AH02 


ANCR 


GDB: 119678 


ANGELMAN SYNDROME 


B2M 


GDB: 119028 


BETA-2-MICROGLOBULIN; B2M 


BBS4 


GDB:5 11199 


BARDET-BIEDL SYNDROME, TYPE 4; 
BBS4 


BLM 


GDB:135698 


BLOOM SYNDROME; BLM 


CAPN3 


GDB:1 19751 


CALPAIN, LARGE POLYPEPTIDE L3; 
CAPN3 MUSCULAR DYSTROPHY, 
LIMB-GIRDLE, TYPE 2; LGMD2 


CDAN1 


GDB:9823267 


DYSERYTHROPOIETIC ANEMIA, 
CONGENITAL, TYPE I 


CDAN3 


GDB:386192 


DYSERYTHROPOIETIC ANEMIA, 
CONGENITAL, TYPE in; CDAN3 



20 



25 



30 



35 
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Gene 


GDB Accession ID 


OMIM Link 




CLN6 


GDB:4073043 


CEROID-Ln'OFUSCINOSIS, NEURONAL 6, 
LATE INFANTILE, VARIANT; CLN6 


5 


CMH3 


GDB: 138299 


CARDIOMYOPATHY, FAMILIAL 
HYPERTROPHIC, 3; CMH3 




CYP19 


GDB: 119830 


CYTOCHROME P450, SUBFAMILY XIX; 
CYP19 


10 


CYP1A1 


GDB:120604 


CYTOCHROME P450, SUBFAMILY I, 
POLYPEPTIDE 1; CYP1A1 


CYP1A2 


GDB: 118780 


CYTOCHROME P450, SUBFAMILY I, 
POLYPEPTIDE 2; CYP1 A2 




DYX1 


GDB:1391796 


DYSLEXIA, SPECIFIC, 1; DYX1 


15 


EPB42 


GDB: 1273 85 | 


HEREDITARY HEMOLYTIC PROTEIN 4.2, 
ERYTHROCYTIC; EPB42 


ETFA 


GDB:1 19121 


GLUTARICACIDURIA II A; GA OA 




EYCL3 


GDB:4590306 


EYE COLOR-3; EYCL3 




FAH 


GDB: 11 9901 


TYROSINEMIA, TYPE I 


20 


FBN1 


GDB: 1271 15 


FIBRILLIN- 1; FBN1 MARFAN 
SYNDROME; MFS 




FES 


GDB: 119906 


V-FES FELINE SARCOMA VIRAL/V-FPS 
FUJINAMI AVIAN SARCOMA VIRAL 
ONCOGENE 




HCVS 


GDB: 119306 


CORONAVIRUS 229E SUSCEPTIBILITY; 
CVS 


25 


HEXA 


GDB: 120040 


TAY-SACHS DISEASE; TSD 




IVD 


GDB: 119354 


ISOVALERICACIDEMIA; IV A 




LCS1 


GDB:1 1500552 


CHOLESTASIS-LYMPHEDEMA 
SYNDROME 


30 


LIPC 


GDB: 119366 


LIPASE, HEPATIC; LIPC 


MY05A 


GDB:2 18824 


MYOSIN VA; MY05A 




OCA2 


GDB: 136820 


ALBINISM II 




OTSC1 


GDB:9860473 


OTOSCLEROSIS 


35 


PWCR 


GDB: 120325 


PRADER-WILLI SYNDROME 
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Gene 


GDB Accession ID 


OMIM Link 


RLBP1 


GDB:127341 


RETIN ALDEH YD E-BINDIN G PROTEIN 1,; 
RLBP1 


SLC12A1 


GDB:386121 


SOLUTE CARRIER FAMILY 12, MEMBER 
1;SLC12A1 


SPG6 


GDB:5 11201 


SPASTIC PARAPLEGIA 6, AUTOSOMAL 
DOMINANT; SPG6 


TPM1 


GDB: 127875 


TROPOMYOSIN 1; TPM1 


UBE3A 


GDB:228487 


ANGELMAN SYNDROME 
UBIQUmN-PROTEIN LIGASE E3 A; 
UBE3A 


WMS 


GDB:5583902 


WEILL-MARCHESANI SYNDROME 



10 



Table 17: 



Genes, Locations and Genetic Disorders on Chromosome 16 



Gene 


GDB Accession ID 


OMIM Link 


ABCC6 


GDB:9315106 


PSEUDOXANTHOMA ELASTICUM, 
AUTOSOMAL DOMINANT; PXE 
PSEUDOXANTHOMA ELASTICUM, 
AUTOSOMAL RECESSIVE; PXE 


ALDOA 


GDB: 11 8993 


ALDOLASE A, FRUCTOSE-BISPHOSPHATE; 
ALDOA 


APRT 


GDB: 119003 


ADENINE 

PHOSPHORIBOSYLTRANSFERASE; APRT 


ATP2A1 


GDB:1 19716 


ATPase, Ca(2+)-TRANSPORTING, 
FAST-TWITCH 1; ATP2A1 BRODY 
MYOPATHY ! 


BBS2 


GDB:229992 


BARDET-BEEDL SYNDROME, TYPE 2; BBS2 


CARD 15 


GDB: 11026232 


SYNOVITIS, GRANULOMATOUS, WITH 
UVEITIS AND CRANIAL NEUROPATHIES 
REGIONAL ENTERITIS 


CATM 


GDB:701219 


MICROPHTHALMIA-CATARACT 


CDH1 


GDB: 120484 


CADHERIN 1; CDH1 


CETP 


GDB: 119773 


CHOLESTERYL ESTER TRANSFER 
PROTEIN, PLASMA; CETP 


CHST6 


GDB:131407 


CORNEAL DYSTROPHY, MACULAR TYPE 


CLN3 


GDB: 120593 


CEROID-LffOFUSCINOSIS, NEURONAL 3, 
JUVENILE; CLN3 



15 



20 



25 



30 



35 
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Gene 


GDB Accession ID 


OMLM Link 




CREBBP 


GDB:437159 


RUBINSTEIN SYNDROME CREB-BINDING 
PROTEIN; CREBBP 


5 


CTH 


GDB: 119086 


CYSTATHIONINURIA 




CTM 


GDB:1 19819 


CATARACT, ZONULAR 




CYBA 


GDB:125238 


GRANULOMATOUS DISEASE, CHRONIC, 
AUTOSOMAL CYTOCHROME-b-NEGATTVE j 
FORM 


10 


CYLD 


GDB:701216 


EPITHELIOMA, HEREDITARY MULTIPLE 
BENIGN CYSTIC 




DHS 


GDB:9958268 


XEROCYTOSIS, HEREDITARY 




DNASE1 


GDB: 132846 


DEOXYRIBONUCLEASE I; DNASE1 




DPEP1 


GDB:128059 


RENAL DIPEPTID ASE 


15 


ERCC4 


GDB:119113 


EXCISION-REPAIR, COMPLEMENTING 
DEFECTIVE, IN CHINESE HAMSTER, 4; | 
ERCC4 XERODERMA PIGMENTOSUM, 
COMPLEMENTATION GROUP F; XPF \ 


20 


FANCA 


GDB:701221 


FANCONI ANEMIA, COMPLEMENTATION 
GROUP A; FACA 


GALNS 


GDB: 129085 


MUCOPOLYSACCHARIDOSIS TYPE IV A 




GAN 


GDB:9864885 


NEUROPATHY, GIANT AXONAL; GAN 




HAGH 


GDB: 119292 


HYDROXY ACYL GLUTATHIONE 
HYDROLASE; HAGH 


25 


HBA1 


GDB: 119293 


HEMOGLOBIN-ALPHA LOCUS- 1; HBA1 




HBA2 


GDB: 119294 


HEMOGLOBIN-ALPHA LOCUS-2; HBA2 




HBHR 


GDB:9954541 


HEMOGLOBIN H-RELATED MENTAL 
RETARDATION 




HBQ1 


GDB: 120036 


HEMOGLOBIN— THET A- 1 LOCUS; HBQ1 


3U 


HBZ 


GDB: 119302 


HEMOGLOBIN— ZETA LOCUS; HBZ 




HBZP 


GDB: 120037 


HEMOGLOBIN— ZETA LOCUS; HBZ 




HP 


GDB: 1193 14 


HAPTOGLOBIN; HP 


35 


HSD11B2 


GDB:409951 


CORTISOL 1 1-BETA-KETOREDUCTASE 
DEFICIENCY 


IL4R 


GDB: 118823 


INTERLEUKIN-4 RECEPTOR; IL4R 
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Gene 


GDB Accession ID 


OMIM Link 




LIPB 


GDB:1 19365 


UPASE B, LYSOSOMAL ACID; LIPB 




MC1R 


GDB: 135 162 


MELANOCORTIN-1 RECEPTOR; MC1R 


5 


MEFV 


GDB: 125263 


MEDITERRANEAN FEVER, FAMILIAL; 
MEFV 




MHC2TA 


GDB:6268475 


MHC CLASS n TRANSACTTVATOR; 
MHC2TA 


10 


MLYCD 


GDB:1 1500940 


MALONYL CoA DECARBOXYLASE 
DEFICIENCY 




PHKB 


GDB: 120286 


PHOSPHORYLASE KINASE, BETA 
SUBUNIT; PHKB 




PHKG2 


GDB: 1403 16 ' 


PHOSPHORYLASE KINASE, TESTIS/LIVER, 
GAMMA 2; PHKG2 


15 


PKD1 


GDB: 120293 


POLYCYSTIC KIDNEYS POLYCYSTIC 
KIDNEY DISEASE 1; PKD1 




PKDTS 


GDB:9954545 


POLYCYSTIC KIDNEY DISEASE, INFANTILE 
SEVERE, WITH TUBEROUS SCLEROSIS; 


20 


PMM2 


GDB:438697 


CARBOHYDRATE-DEFICIENT 
GLYCOPROTEIN SYNDROME, TYPE I; CDCjI 
PHOSPHOMANNOMUTASE 2; PMM2 




PXE 


GDB:6053895 


PSEUDOXANTHOMA ELASTICUM, 
AUTOSOMAL DOMINANT; PXE 
PSEUDOXANTHOMA ELASTICUM, 
AUTOSOMAL RECESSIVE; PXE 


25 


SALL1 


GDB:4216161 


TOWNES-BROCKS SYNDROME; TBS 
SAL-LIKE 1; SALL1 




SCA4 


GDB:250364 


SPINOCEREBELLAR ATAXIA 4; SCA4 




SCNN1B 


GDB:434471 


SODBJM CHANNEL, NONVOLTAGE-GATED 
1 BETA; SCNN1B 


30 


SCNN1G 


GDB:568759 


SODKJM CHANNEL, NONVOLTAGE-GATED 
1 GAMMA; SCNN1G 




TAT 


GDB:120398 


TYROSINE TRANSAMINASE DEFICIENCY 




TSC2 


GDB: 120466 


TUBEROUS SCLEROSIS-2; TSC2 


35 


VDI 


GDB: 119629 


DEFECTIVE INTERFERING PARTICLE 
INDUCTION, CONTROL OF 


WT3 


GDB:9958957 


WILMS TUMOR, TYPE HI; WT3 
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Table 1 8: Genes, Locations and Genetic Disorders on Chromosome 17 





VJvLIb 


GDB Accession ID 


OMIM Link 


5 


ABR 


GDB T 19642 


ACTIVE BCR-RELATED GENE; ABR 




ACACA 


GDB:120534 


ACETYL-CoA CARBOXYLASE 
DEFICIENCY 




ACADVL 


GDBT248185 


ACYL-CoA DEHYDROGENASE, 
VERY-LONG-CHAIN, DEFICIENCY OF 


10 


ACE 


GDB- 119840 


DIPEPTIDYL CARBOXYPEPTIDASE-1; 
DCP1 




ALDH3A2 


GDB 1316855 


SJOGREN-LARSSON SYNDROME; SLS 




APOH 


GDB- 118887 


APOLIPOPROTEIN H; APOH 


15 


ASPA 


GDB'231014 


SPONGY DEGENERATION OF CENTRAL 
NERVOUS SYSTEM 




AXIN2 


GDB '9864782 ' 


CANCER OF COLON 




BCL5 


GDB:125178 


LEUKEMIA/LYMPHOMA, CHRONIC 
B-CELL, 5; BCL5 


20 


BHD 


GDB: 11498904 


WITH TRICHODISCOMAS AND 
ACROCHORDONS 




BLMH 


GDB:3801467 


BLEOMYCIN HYDROLASE 




BRCA1 


GDB:1266U 


BREAST CANCER, TYPE 1; BRCA1 




CACD 


GDB:5885801 


CHOROIDAL DYSTROPHY, CENTRAL 
AREOLAR; CACD 


25 


CCA1 


GDB:1 18763 


CATARACT, CONGENITAL, CERULEAN 
TYPE 1; CCA1 




CCZS 


GDB:681973 


CATARACT, CONGENITAL ZONULAR, 
WITH SUTURAL OPACITIES; CCZS 


30 


CHRNB1 


GDB: 120587 


CHOLINERGIC RECEPTOR NICOTINIC, 
BETA POLYPEPTIDE 1; CHRNB1 




CHRNE 


GDB: 132246 


CHOLINERGIC RECEPTOR, NICOTINIC, 
EPSILON POLYPEPTIDE; CHRNE 


35 


CMT1A 


GDB: 11 9785 


CHARCOT-MARIE-TOOTH DISEASE, 
TYPE 1A; CMT1 A NEUROPATHY, 
HEREDITARY, WITH LIABILITY TO 
PRESSURE PALSIES; HNPP 
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Gene 


GDB Accession ID 


OMEM Link 


5 


COL1A1 


GDB:1 19061 


COLLAGEN, TYPE I, ALPHA- 1 CHAIN; 
COL1A1 OSItlOGbNESIb IMrbRrcLlA 
TYPE I OSTEOGENESIS IMPERFECTA 
TYPE IV; OI4 




CORD5 


GDB:568473 


CONE-ROD DYSTROPHY-5; CORD5 




CTNS 


GDB:700761 


CYSTINOSIS, EARLY-ONSET OR 
INFANTILE NEPHROPATHY TYPE ! 


10 


EPX 


GDB:377700 


EOSINOPHIL PEROXIDASE; EPX 


ERBB2 


GDB:120613 


V-ERB-B2 AVIAN ERYTHROBLASTIC 
LEUKEMIA VIRAL ONCOGENE 
HOMOLOG 2; ERBB2 




G6PC 


GDB:231927 


GLYCOGEN STORAGE DISEASE I; GSD-I 


15 


GAA 


GDB:1 19965 


GLYCOGEN STORAGE DISEASE II 


GALK1 


GDB: 119246 


GALACTOKINASE DEFICIENCY 




GCGR 


GDB-.304516 


GLUCAGON RECEPTOR; GCGR 




GFAP 


GDB: 118799 


GLIAL FIBRILLARY ACIDIC PROTEIN; 
GFAP ALEXANDER DISEASE 


20 


GH1 


GDB: 119982 


GROWTH HORMONE 1; GH1 




GH2 


GDB: 119983 


GROWTH HORMONE 2; GH2 




GP1BA 


GDB:1 18806 


GIANT PLATELET SYNDROME 




GPSC 


GDB:9954564 


FAMILIAL PROGRESSIVE SUBCORTICAL 


25 


GUCY2D 


GDB:1 36012 


AMAUROSIS CONGENITA OF LEBER I 
GUANYLATE CYCLASE 2D, MEMBRANE; 
GUC2D CONE-ROD DYSTROPHY-6; 
CORD6 




ITGA2B 


GDB: 120012 


THROMBASTHENIA OF GLANZMANN 
AND NAEGELI 


30 


ITGB3 


GDB: 120013 


INTEGRIN, BETA-3 ; ITGB3 


ITGB4 


GDB:128028 


INTEGRIN, BETA-4; ITGB4 




KRT10 


GDB:1 18828 


KERATIN 10;KRT10 


35 


KRT12 


GDB:5583953 


CORNEAL DYSTROPHY, JUVENILE 
EPITHELIAL, OF MEESMANN KERATIN 
12; KRT12 



- 120- 



NY2- 1321355.1 





Gene 


GDB Accession ID 


OMIM Link 




KRT13 


GDB:120740 


KERATIN 13; KRT13 


5 


KRT14 


GDB:132145 


KERATIN 14; KRT14 GLUTATHIONE 
SYNTHETASE; GSS 




KRT14L1 


GDB:120121 


KERATIN 14; KRT14 




KRT14L2 


GDB:1 20122 


KERATIN 14; KRT14 




KRT14L3 


GDB:120123 


KERATIN 14; KRT14 


10 


KRT16 


GDB: 136207 


KERATIN 16; KRT16 




KRT16L1 


GDB: 120 125 


KERATIN 16; KRT16 




KRT16L2 


GDB:120126 


KERATIN 16; KRT1 6 




KRT17 


GDB: 1362 11 


KERATIN 17; KRT17 PACHYONYCHIA 


15 


KRT9 


GDB:303970 


EPIDERMOLYTIC 


20 


MAPT 


GDB: 119434 


MICROTUBULE-ASSOCIATED PROTEIN 
TAU; MAPT PALLIDOPONTONIGRAL 
DbCjhNfcKAIlUJN, .rrlND 
DISINHIBITION-DEMENTIA-PARKINSONI 
SM-AMYOTROPHY COMPLEX; DDPAC 




MDB 


GDB:9958959 


MEDULLOBLASTOMA; MDB 




MDCR 


GDB: 120525 


SYNDROME; MDLS 
PLATELET-ACTIVATING FACTOR 
ACETYLHYDROLASE, GAMMA SUBUNIT 


25 


MGI 


GDB:9954550 


MYASTHENIA GRAVIS, FAMILIAL 
INFANTILE; FIMG 




MHS2 


GDB:132580 


MALIGNANT HYPERTHERMIA 
SUSCEPTIBILITY-2; MHS2 




MKS1 


GDB:681967 


MECKEL SYNDROME; MKS 


30 


MPO 


GDB:120192 


MYELOPEROXIDASE DEFICIENCY 




MUL 


GDB:636050 


MULIBREY NANISM; MUL 




MY015A 


GDB:9838006 


DEAFNESS, NEUROSENSORY, 
AUTOSOMAL RECESSIVE, 3; DFNB3 


35 


NAGLU 


GDB:636533 


MUCOPOLYSACCHARIDOSIS TYPE UIB 
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Gene 


GDB Accession ID 


OMIM Link 




NAPB 


GDB:9954572 


NEURITIS WITH BRACHIAL 
PREDILECTION; NAPB 


5 


NF1 


GDB:l2023l 


NEUROFIBROMATOSIS, TYPE I; NF1 




NME1 


GDB: 127965 


IN \JJN-JVLt2f 1 1 A 1 VsEfL^Uo 1 , rl\\J 1 E»JLTN 

EXPRESSED IN; NME1 




P4HB 


GDB: 120708 


T)T3 r\T VT A UVnDnYVT A QL7 A 

rKUL i L»-4-ri x JJJKAJ A Y JL/Voii, Die, I J\ 
POLYPEPTIDE; PHDB; PROHB 


10 


PAFAH1B1 


GDB:677430 


\yrrr t rrr> rurrT^Pl? T TQQT7>jr i P?PT4AT V 

SYNDROME; MDLS 
PLATELET-ACTIVATING FACTOR 
ACETYLHYDROLASE, GAMMA SUBUNIT 




PECAM1 


GDB:696372 


PLATELET-ENDOTHELIAL CELL 
ADHESION MOLECULE; PECAM1 


15 


PEX12 


GDB:6155804 


Zihl/LfL, W l^OC/iv o liNIJxvlJiVlxi, Zvo 
PEROXIN-12; PEX12 




PHB 


GDB: 126600 


PROHIBITIN; PHB 


zU 


PMP22 


GDB: 134190 


CHARCOT-MARIE-TOOTH DISEASE, 
TYPE 1A- CMT1 A HYPERTROPHIC 
NEUROPATHY OF DEJERINE-SOTTAS 

T>T7r> mtrCD A T A vfVCT TXT DD /^fCTKT OO • 

PMP22 


25 


PRKARIA 


GDB: 120313 


MiaUMA, arUl 1 Y rHjJVLEJN lAlUJiN, 
AND ENDOCRINE OVERACTIVITY 
PROTEIN KINASE, cAMP-DEPENDENT, 
REGULATORY, TYPE L ALPHA; 
PRKARIA 


PRKCA 


GDB: 128015 


PROTEIN KINASE C, ALPHA; PRKCA 




PRKWNK4 


GDB:9954566 


PSEUDOHYPOALDOSTERONISM TYPE II, 
LOCUS B; PHA2B 




PRP8 


GDB:9957697 


RETINITIS PIGMENTOSA- 13; RP13 


30 


PRPF8 


GDB:392647 


RETINITIS PIGMENTOSA-13; RP13 




PTLAH 


GDB:9957342 


APLASIA OR HYPOPLASIA 




RARA 


GDB: 120337 


RETINOIC ACID RECEPTOR, ALPHA; 
RARA 




RCVl 


GDB: 135477 


RECOVERED; RCVl 
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Gene 


GDB Accession ID 


OM1M Link 




RMSA1 


GDB.-304519 


REGULATOR OF MITOTIC SPINDLE 
ASSEMBLY 1; RMSA1 


5 


RP17 


GDB-683199 


RETINITIS PIGMENTOSA-17; RP17 




RSS 


GDB:439249 


RUSSELL-SILVER SYNDROME; RSS 




SCN4A 


GDB:125181 


PERIODIC PARALYSIS II 




SERPINF2 


GDB:120301 


PLASMIN INHIBITOR DEFICIENCY 


10 


SGCA 


GDB:3 84077 


ADHALIN; ADL 




SGSH 


GDB:1319101 


MUCOPOLYSACCHARIDOSIS TYPE IDA 




SHBG 


GDB: 125280 


SEX HORMONE BINDING GLOBULIN; 
SHBG ! 


15 


SLC2A4 


GDB: 11 9997 


SOLUTE CARRIER FAMILY 2, MEMBER 4; 
SLC2A4 


20 


SLC4A1 


GDB: 11 9874 


SOLUTE CARRIER FAMILY 4, ANION 
EXCHANGER, MEMBER 1; SLC4A1 
BLOOD GROUP-DIEGO SYSTEM; DI 
BLOOD GROUP-WRIGHT ANTIGEN; Wr 
ELLIPTOCYTOSIS, RHESUS-UNLINKED 
TYPE HEREDITARY HEMOLYTIC 


SLC6A4 


GDB:134713 


SOLUTE CARRIER FAMILY 6, MEMBER 4; 
SLC6A4 




SMCR 


GDB: 120379 


SMITH-MAGENIS SYNDROME; SMS 




SOST 


GDB: 10450629 


SCLEROSTEOSIS 


ZD 


SOX9 


GDB:134730 


DYSPLASIA 




SSTR2 


GDB: 1341 86 


SOMATOSTATIN RECEPTOR-2; SSTR2 




SYM1 


GDB:512174 


SYMPHALANGISM, PROXIMAL; SYM1 




SYNS1 


GDB:9862343 


SYNOSTOSES, MULTIPLE, WITH 
BRACHYDACTYLY 


30 


TCF2 


GDB: 125298 


TRANSCRIPTION FACTOR-2, HEPATIC; 
TCF2 




THRA 


GDB: 120730 


THYROID HORMONE RECEPTOR, ALPHA 
1; THRA 


35 


TIMP2 


GDB:1 32612 


TISSUE INHIBITOR OF 
METALLOPROTEINASE-2; TIMP2 
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Gene 


GDB Accession ID 


OMIMLink 


TOC 


GDB:451978 


TYLOSIS WITH ESOPHAGEAL CANCER; 
TOC 


TOP2A 


GDB:1 18884 


TOPOISOMERASE (DNA) II, ALPHA; 
TOP2A 


TP53 


GDB:120445 


CANCER, HEPATOCELLULAR 
LI-FRAUMENI SYNDROME; LFS TUMOR 
PROTEIN p53; TP53 CARCINOMA 


VBCH 


GDB:9954554 


HYPEROSTOSIS CORTICALIS 
GENERALISATA 



Table 19: Genes, Locations and Genetic Disorders on Chromosome 18 





Gene 


GDB Accession ID 


OMIM Link 


15 


ATP8B1 


GDB:453352 


CHOLESTASIS, PROGRESSIVE FAMILIAL 
INTRAHEPATIC l; PFICl INTRAHEPA1IC 
CHOLESTASIS FAMILIAL INTRAHEPATIC 
CHOLESTASIS-l ; FIC1 




BCL2 


GDB:1 19031 


B-CELL CLL/LYMPHOMA 2; BCL2 




CNSN 


GDB:9954580 


C ARNO SINEMIA 




CORD1 


GDB:1 18773 


CONE-ROD DYSTROPHY- 1 ; CORD1 


20 


CYB5 


GDB:125236 


METHEMOGLOBINEMIA DUE TO j 
DEFICIENCY OF CYTOCHROME b5 




DCC 


GDB: 119838 


DELETED IN COLORECTAL CARCINOMA; 
DCC 


25 


F5F8D 


GDB:6919858 


FACTOR V AND FACTOR VDI, COMBINED 
DEFICIENCY OF; F5F8D 


FECH 


GDB:127282 


PROTOPORPHYRIA, ERYTHROPOIETIC 




FEO 


GDB:4378120 


POLYOSTOTIC OSTEOLYTIC DYSPLASIA, 
HEREDITARY EXPANSILE; HEPOD 




LAMA3 


GDB:251818 


LAMININ, ALPHA 3; LAMA3 


30 


LCFS2 


GDB:9954578 


CANCER 


MADH4 


GDB:4642788 


POLYPOSIS, JUVENILE INTESTINAL 
MOTHERS AGAINST DECAPENTAPLEGIC, 
DROSOPHILA, HOMOLOG OF, 4; MADH4 




MAFD1 


GDB:120163 


MANIC-DEPRESSIVE PSYCHOSIS, 
AUTOSOMAL 


35 


MC2R 


GDB:135163 


ADRENAL UNRESPONSIVENESS TO ACTH 
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Gene 


GDB Accession ID 


OMIM Link 


MCL 


GDB:9954574 


LEIOMYOMA I A, rtbKbUI 1 AKY MUJLllrJLfc,, 
OF SKIN 


MYP2 


GDB:9862232 


MYOPIA 


NPC1 


GDB:1 38178 


NIEMANN-PICK DISEASE, TYPE CI ; NPC1 


SPPK 


GDB:606444 


PALMOPLANTARIS STRIATA 


TGFBRE 


GDB:250852 


TRANSFORMING GROWTH FACTOR, BETA 
1 RESPONSE ELEMENT 


TGIF 


GDB:9787150 


HOLOPROSENCEPHALY, TYPE 4; HPE4 


TTR 


GDB: 119471 


TRANSTHYRETIN; TTR 



Table 20 : Genes, Locations and Genetic Disorders on Chromosome 1 9 



Gene 


GDB Accession ID 


OMIM Link 


AD2 


GDB:1 18748 


ALZHEIMER DISEASE-2; AD2 


AMH 


GDB: 118996 


PERSISTENT MULLERIAN DUCT 
SYNDROME, TYPES I AND II; PMDS 
AMTT-MT TT T FRTAN HORMONE* AMH 


APOC2 


GDB:1 19689 


APOLIPOPROTEIN C-H DEFICIENCY, 
TYPE I HYPERLIPOPROTEINEMIA DUE 
TO 


APOE 


GDB:1 19691 


APOLIPOPROTEIN E; APOE 


ATHS 


GDB:128803 


LIPOPROTEIN PHENOTYPE; ALP 


BAX 


GDB:228082 


BCL2-ASSOCIATED X PROTEIN; BAX 


BCKDHA 


GDB:1 19723 


MAPLE SYRUP URINE DISEASE 


BCL3 


GDB:120561 


B-CELL LEUKEMLVLYMPHOMA-3 ; 
BCL3 


BFIC 


GDB:9954584 


BENIGN FAMILIAL INFANTILE 
CONVULSIONS 


C3 


GDB:1 19044 


COMPLEMENT COMPONENT-3; C3 


CACNA1A 


GDB: 126432 


ATAXIA, PERIODIC 

VESTIBULOCEREBELLAR HEMIPLEGIC 
MIGRAINE, FAMILIAL; MHP 
SPINOCEREBELLAR ATAXIA 6; SCA6 
CALCIUM CHANNEL, 
VOLTAGE-DEPENDENT, P/Q TYPE, 
ALPHA 1 A SUBUNIT; CACNA1A 
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Gene 


GDB Accession ID 


OMIM Link 




CCO 


GDB: 119755 


CENTRAL CORE DISEASE OF MUSCLE 




CEACAM5 


GDB:1 19054 


CARCINOEMBRYONIC ANTIGEN; CEA 


5 


COMP 


GDB:344263 


EPIPHYSEAL DYSPLASIA, MULTIPLE; 
MED PSEUDOACHONDROPLASTIC 
DYSPLASIA CAKIILACjii ULlLrUMbRlU 
MATRIX PROTEIN; COMP 


10 


CRX 


GDB:333932 


CONE-ROD DYSTROPHY-2; CORD2 
AMAUROSIS CONGENITA OF LEBER I 
CONE-ROD HOMEO BOX-CONTAINING 
GENE 




DBA 


GDB:9600353 


ANEMIA, CONGENITAL HYPOPLASTIC, 
OF BLACKF AN AND DIAMOND 




DDU 


GDB:10796026 


URTICARIA; DDU 


15 


DFNA4 


GDB:606540 


DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 4; 
DFNA4 




DLL3 


GDB:9959026 


VERTEBRAL ANOMALIES 




DMPK 


GDB: 119097 


DYSTROPHIA MYOTONICA; DM 


20 


DMWD 


GDB:7178354 


DYSTROPHIA MYOTONICA; DM 




DPD1 


GDB:10796170 


ENGELMANN DISEASE 




E11S 


GDB:1 19101 


ECHO 1 1 SENSITIVITY; El IS 




ELA2 


GDB:1 18792 


ELASTASE-2; ELA2 NEUTROPENIA, 
CYCLIC 


25 


EPOR 


GDB: 125242 


ERYTHROPOIETIN RECEPTOR; EPOR 




ERCC2 


GDB:1 19112 


EXCISION-REPAIR, COMPLEMENTING 
DEFECTIVE, IN CHINESE HAMSTER, 2; 
ERCC2 XERODERMA PIGMENTOSUM 
IV;XP4 


30 


ETFB 


GDB: 119887 


ELECTRON TRANSFER FLAVOPROTEIN, 
BETA POLYPEPTIDE; ETFB 




EXT3 


GDB:383780 


EXOSTOSES, MULTIPLE, TYPE B3; EXT3 




EYCL1 


GDB: 119269 


EYE COLOR-1; EYCL1 




FTL 


GDB: 119234 


FERRITIN LIGHT CHAIN; FTL 


35 


FUT1 


GDB: 12061 8 


FUCOSYLTRANSFERASE-1 ; FUT1 
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Gene 


GDB Accession ID 


OMLMLink 




FUT2 


GDB:120619 


FUCOSYLTRANSFERASE-2; FUT2 




FUT6 


GDB:135180 


FUCOSYLTRANSFERASE-6- FUT6 


5 


GAMT 


GDB:1313736 


GU ANIDINO ACETATE 
METHYLTRANSFERASE; GAMT 




GCDH 


GDB: 136004 


GLUTARICACDDEMIA I 




GPI 


GDB: 1200 15 


GLUCOSEPHOSPHATE ISOMFRASF- OPT 


10 


GUSM 


GDB: 119291 


GLUCURONIDASE, MOUSE, MODIFIER 
OF- GUSM 




HB1 


GDB:9954586 


BUNDLE BRANCH BLOCK 




HCL1 


GDB: 119304 


HAIRCOLOR-l;HCLl 


ID 


HHC2 


GDB:249836 


HYPOCALCIURIC HYPERCALCEMIA, 
FAMILIAL TYPE II- HHC2 




HHC3 


GDB:9955121 


HYPOCALCIURIC HYPFRCAI CFMTA 
FAMILIAL, TYPE IE; HHC3 




ICAM3 


GDB: 136236 


INTERCELLULAR ADHESION 
MOLECULE-3; ICAM3 


20 


INSR 


GDB* 119352 


INSULIN RECEPTOR- INSR 

ll^OUXiJiN XVX_fV-/X_>X lUIV, JUNOIV 




JAK3 


GDB'376460 


JANUS KINASE ^ TAK^ 




KLK3 


GDB: 119695 


ANTIGEN, PROSTATE-SPECIFIC; APS 




LDLR 


GDB: 119362 


HYPERCHOLESTEROLEMIA, FAMILIAL; 

fhc 

X L X V> 


25 


LHB 


GDB: 119364 


LUTEINIZING HORMONE, BETA 
POLYPFPTIDF- I HR 

X W -l.^ X X 1 /i, X 1L/L>^ XJJ 




LIG1 


GDB* 127274 


T TO ASF T DMA ATP-nFPFMTiFWT- T Tfll 
xwiw^vojj# i, lvinz-v, /a 1 x-i^i^x Join i^x_*in i , jlj\Jx 




LOH19CR1 


GDB.9837482 


ANEMIA, CONGENITAL HYPOPLASTIC, 
OF BLACKFAN AND DIAMOND 


30 


LYL1 


GDB: 1201 58 


LEUKEMIA, LYMPHOID, 1; LYL1 




MAN2B1 


GDB: 119376 


MANNOSIDOSIS, ALPHA B, 
LYSOSOMAL 




MCOLN1 


GDB:10013974 


MUCOLIPIDOSIS IV 


35 


MDRV 


GDB:6306714 


MUSCULAR DYSTROPHY, AUTOSOMAL 
DOMINANT, WITH RIMMED VACUOLES; 
MDRV 
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10 



25 



35 



Gene 


GDB Accession ID 


OMIM Link 


MLLT1 


GDB: 136791 


MYELOID/LYMPHOID OR MIXED 
LINEAGE LEUKEMIA, TRANSLOCATED 
TO, 1; MLLT1 


NOTCH3 


GDB:361163 


DEMENTIA, HEREDITARY 
MULTI-INFARCT TYPE NOTCH, 
DROSOPHILA, HOMOLOG OF, 3; 
NOTCH3 

X X V/ X \-/X X«y 


NPHS1 


GDB342105 


NEPHROSTS 1 CONGENITAL FINNISH 
TYPE; NPHS1 


OFC3 


GDB: 128060 


OROFACIAL CLEFT-3; OFC3 


OPA3 


GDB:9954590 


OPTIC ATROPHY INFANTILE WITH 

A *-V> XA. X 1\V/X X X X y XI ~ X *VA ^1 X 1 1 /I,/} W X X XX 

CHOREA AND SPASTIC PARAPLEGIA 


PEPD 


GDB: 120273 


PEPTIDASE D* PEPD 


PRPF31 


GDB:333911 


RETINITIS PIGMENTOSA 1 1 ; RP1 1 


PRTN3 


GDB: 126876 


PROTEINASE 3; PRTN3; PR3 


PRX 


GDB: 11501256 


HYPERTROPHIC NEUROPATHY OF 
DEJERINE-SOTTAS 


PSG1 


GDB: 120321 


PREGNANCY-SPECIFIC 
BETA-1 -GLYCOPROTEIN 1; PSG1 


PVR 


GDB: 120324 


POLIOVIRUS SUSCEPTIBILITY, OR 
SENSITIVITY; PVS 


RYR1 


GDB: 120359 


CENTRAL CORE DISEASE OF MUSCLE 
HYPERTHERMIA OF ANESTHESIA 
RYANODINE RECEPTOR-1; RYR1 


SLC5A5 


GDB.5892184 


SOLUTE CARRIER FAMILY 5, MEMBER 
5; SLC5A5 


SLC7A9 


GDB:9958852 


C YSTINUPJA, TYPE III; CSNU3 


STK11 


GDB:9732383 


PEUTZ-JEGHERS SYNDROME 
SERINE/THREONINE PROTEIN KINASE 
11;STK11 


TBXA2R 


GDB: 1275 17 


THROMBOXANE A2 RECEPTOR, 
PLATELET; TBXA2R 


TGFB1 


GDB: 120729 


ENGELMANN DISEASE 
TRANSFORMING GROWTH FACTOR, 
BETA-1; TGFB1 


TNNI3 


GDB: 125309 


TROPONIN I, CARDIAC; TNNI3 
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Gene 


GDB Accession ID 


OMIM Link 


TYROBP 


GDB:9954457 


POLYCYSTIC LIPOMEMBRANOUS 
OSTEODYSPLASIA WITH SCLEROSING 
LEUKOENCEPHALOPATHY 



5 Table 2 1 : Genes, Locations and Genetic Disorders on Chromosome 20 



10 



20 



25 



30 



Gene 


GDB Accession ID 


OMIM Link 


ADA 


GDB- 11 9649 


ADFNOSTNF OFAMTNASF* A1")A 


AHCY 


GDB: 11 8983 


S-ADENOSYLHOMOCYSTEINE 
HYDROLASE; AHCY 


AVP 


GDB: 119009 


DIABETES INSIPIDUS, 
NEUROHYPOPHYSEAL TYPE ARGIN1NE 
VASOPRESSIN- AVP 


CDAN2 


GDB:9823270 


DYSERYTHROPOIETIC ANEMIA 
CONOFN1TAT TYPF TI 


CDMP1 


GDB:438940 


CHONDRODYSPLASIA, GREBE TYPE 
PROTEIN 1 


CHED1 


GDB:3837719 


CORNEAL DYSTROPHY, CONGENITAL 
ENDOTHELIAL; CHED 


CHRNA4 


GDB: 128 169 


CHOLINERGIC RECEPTOR NFUROWAT 
NICOTINIC, ALPHA POLYPEPTIDE 4; 
CHRNA4 EPILEPSY, BENIGN NEONATAL; 
EBN1 


CST3 


GDB:1 19817 


AMYLOIDOSIS VI 


EDN3 


GDB:1 19862 


ENDOTHELIN-3; EDN3 
WAARDENBURG-SHAH SYNDROME 


EEGV1 


GDB:127525 


ELECTROENCEPHALOGRAM, 
LOW-VOLTAGE 


FTLL1 


GDB: 119235 


FERRITIN LIGHT CHAIN; FTL 


GNAS 


GDB: 120628 


GUANINE NUCLEOTIDE-BINDING 
PROTEIN, ALPHA-STIMULATING 
POLYPEPTIDE; 



35 
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GDB Accession ID 


OMIM Link 


5 


GSS 


GDB:637022 


GLUTATHIONE SYNTHETASE 
DEFICIENCY OF ERYTHROCYTES, 
HEMOLYTIC ANEMIA 
PYROGLUTAMICACIDURIA 
HNF4AGDB:393281DIABETES MELLITUS, 
AUTOSOMAL DOMINANT 
TR AXI9PRTPTTON FACTOR 14. HEPATIC 
NUCLEAR FACTOR; TCF14 


10 


JAG1 


GDB:6175920 


CHOLESTASIS WITH PERIPHERAL 




KCNQ2 


GDB:9787229 


EPILEPSY, BENIGN NEONATAL; EBN1 

PO 1 AbolUM UrLAJNINJtSL, 

VOLTAGE-GATED, SUBFAMILY Q, 
MEMBER 2 




MKKS 


GDB:9860197 


HYDROMETROCOLPOS SYNDROME 


15 


NBIA1 


GDB:4252819 


HALLERVORDEN-SPATZ DISEASE 




PCK1 


GDB: 125349 


PHUoFrlUliN UJLr Y KU V A 1 c* 
CARBOXYKINASE 1, SOLUBLE; PCK1 




PI3 


GDB:203940 


nnATTTXTACn TXTT-TT'DT r TY"YD *2 • "DT^ 

PROTEINASE INHIBIIOK 5, r!3 


20 


PPGB 


GDB: 119507 


NEURAMINIDASE, DiirlULbJsU i Wllti 
BETA-GALACTOSIDASE DEFICIENCY 




PPMD 


GDB:702144 


CORNEAL DYSTROPHY, HEREDITARY 
POLYMORPHOUS POSTERIOR; PPCD 




PRNP 


GDB: 120720 . 


GERSTMANN-STRAUSSLER DISEASE; 
GSD PRION PROTEIN; PRNP 


25 


THBD 


GDB:1 19613 


THROMBOMODULIN; THBD 




TOPI 


GDB:120444 


TOPOISOMERASE (DNA) I; TOPI 



Table 22: Genes, Locations and Genetic Disorders on Chromosome 21 



30 



35 



Gene 


GDB Accession ID 


OMIM Link 


AIRE 


GDB:567198 


AUTOIMMUNE 

POLYENDOCRINOPATHY-CANDIDIA 
SIS-ECTODERMAL DYSTROPHY; 
APECED 


APP 


GDB: 119692 


ALZHEIMER DISEASE; AD AMYLOID 
BETA A4 PRECURSOR PROTEIN; APP 
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Gene 


GDB Accession ID 


OMIM Link 




CBS 


GDB:1 19754 


HOMOCYSTINURIA 


5 


COL6A1 


GDB: 119065 


COLLAGEN, TYPE VI, ALPHA- 1 ! 
CHAIN; COL6A1 MYOPATHY, 
BENIGN CONGENITAL, WITH 




COL6A2 


GDB:1 iv/yj 


t Ap r pvf TYPF VT ALPHA-2 
CHAIN; COL6A2 MYOPATHY, 
BENIGN CONCiliNllAL, Wlln 
CONTRACTURES 


10 


CSTB 


GDB:52 15249 


MYOCLONUS EPILEPSY OF 
UNVERRICHT AND LUNDBORG 
CYSTATIN B; CSTB 




DCR 


GDB: 125354 


TRISOMY 21 




DSCR1 


GDB:731000 


TRISOMY 21 


FPDMM 


GDB:9954610 


CORE-BINDING FACTOR, RUNT 
DOMAIN, ALPHA SUBUNIT 2; CBFA2 
PT A TFT FT nT^PVRDFR FAMILIAL 
WITH ASSOCIATED MYELOID 
MALIGNANCY 


20 


HLCS 


GDB:392648 


MULTIPLE CARBOXYLASE 
DEFICIENCY, BIOTIN-RESPONSIVE; 
MCD 




HPE1 


GDB: 136065 


T */\r /->t\t> /-vrtT^vT/^'PTIT TAT "\T T? A X ATT TAT 

HOLOPROSENCEPHALY, JbAMlLLAL 
ALOBAR 




ITGB2 


GDB: 120574 


INTEGRIN BETA-2; ITGB2 


25 


KCNE1 


GDB: 127909 


POTASSIUM VOLTAGE-GATED 
CHANNEL, ISK-RELATED 
SUBFAMILY, MEMBER 1; 




KNO 


GDB:4073044 


KNOBLOCH SYNDROME; KNO 




PRSS7 


GDB:384083 


ENTEROKINASE DEFICIENCY 


30 


RUNX1 


GDB: 1283 13 


CORE-BINDING FACTOR, RUNT 
DOMAIN, ALPHA SUBUNIT 2; CBFA2 
PLATELET DISORDER, FAMILIAL, 
WITH ASSOCIATED MYELOID 
MALIGNANCY 



35 
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Gene 


GDB Accession ID 


OMIM Link 


SOD1 


GDB: 119596 


AMYOTROPHIC LATERAL 
SCLEROSIS SUPEROXIDE 
DISMUTASE-l;SODl MUSCULAR 
ATROPHY, PROGRESSIVE, WITH 
AMYOTROPHIC LATERAL 
SCLEROSIS 


TAM 


008:9958709, 


MYELOPROLIFERATIVE 
SYNDROME, TRANSIENT 



Table 23: Genes, Locations and Genetic Disorders on Chromosome 22 
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20 



25 



30 



35 



Gene 


GDB Accession ID 


OMIM Link 


ADSL 


GDB: 119655 


ADENYLOSUCCINATE LYASE; ADSL 


ARSA 


GDB:1 19007 


METACHROMATIC 

LEUKODYSTROPHY, LATE-INFANTILE 


BCR 


GDB: 120562 


BREAKPOINT CLUSTER REGION; BCR 


CECR 


GDB: 119772 


CAT EYE SYNDROME; CES 


CHEK2 


GDB:9958730 


LI-FRAUMENI SYNDROME; LFS 
OSTEOGENIC SARCOMA 


COMT 


GDB: 11 9795 


CATECHOL-O-METHYLTRANSFERASE 
;COMT 


CRYBB2 


GDB: 11 9075 


CRYSTALLIN, BETA B2; CRYBB2 
CATARACT, CONGENITAL, 
CERULEAN TYPE, 2; CCA2 


CSF2RB 


GDB:126838 


GRANULOCYTE-MACROPHAGE 
COLONY-STIMULATING FACTOR 
RECEPTOR, BETA SUBUNIT; 


CTHM 


GDB:439247 


HEART MALFORMATIONS; CTHM 


CYP2D6 


GDB: 132127 


CYTOCHROME P450, SUBFAMILY IID; 
CYP2D 


CYP2D@ 


GDB:1 19832 


CYTOCHROME P450, SUBFAMILY IID; 
CYP2D 


DGCR 


GDB: 119843 


DIGEORGE SYNDROME; DGS 


DIA1 


GDB: 119848 


METHEMOGLOBINEMIA DUE TO 
DEFICIENCY OF METHEMOGLOBIN 
REDUCTASE 


EWSR1 


GDB:135984 


EWING SARCOMA; EWS 


GGT1 


GDB: 120623 


GLUTATHIONURIA 
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25 



Gene 


GDB Accession ID 


OMIM Link 


MGCR 


GDB: 120180 


MENINGIOMA; MGM 


MN1 


GDB580528 


MFNTNOTOM A ■ MOM 


NAGA 


GDB: 11 9445 


ALPHA- G AL ACTO SID AS E B; GALB 


NF2 


GDB: 120232 


NEUROFIBROMATOSIS, TYPE II; NF2 


0GS2 


GDB:9954619 


HYPERTELORISM WITH ESOPHAGEAL 
ABNORMALITY AND HYPOSPADIAS 


PDGFB 


GDB: 120709 


V-SIS PLATELET-DERTVED GROWTH 
FACTOR BETA POLYPEPTIDE; PDGFB 


PPARA 


GDB:202877 


PEROXISOME PROLIFERATOR 
ACTTVATFn rfppptph? at pt-t a • 

PPARA 


PRODH 


GDB5215168 


HYPFRPROT nsTPMTA TYPP T 


SC02 


GDB*9958568 


CYTOCHROME r OVTHA^F 
DEFICIENCY 


SCZD4 


GDB: 1387047 


SCHIZOPHRENIA DISORDER-4; SCZD4 


SERPIND1 


GDB- 120038 


RFPARTM COFACTOP TT* WPFO 


SLC5A1 


GDB: 120375 


SOLUTE CARRIER FAMILY 5 
MEMBER 1;SLC5A1 


SOX10 


GDB:9834028 


SRY-BOX 10; SOX10 


TCN2 


GDB: 119608 


TRANSCOBALAMIN II DEFICIENCY 


TIMP3 


GDB: 1381 75 


TISSUE INHIBITOR OF 
METALLOPROTEIN ASE-3 ; TIMP3 


VCF 


GDB:136422 


VELOCARDIOFACIAL SYNDROME 



Table 24: Genes, Locations and Genetic Disorders on Chromosome X 



30 



35 



Gene 


GDB Accession ID 


OMIM Link 


ABCD1 


GDB: 118991 


ADRENOLEUKODYSTROPHY; ALD 


ACTL1 


GDB: 119648 


ACTIN-LIKE SEQUENCE- 1; ACTL1 


ADFN 


GDB: 118977 


ALBINISM-DEAFNESS SYNDROME; ADFN; 
ALDS 


AGMX2 


GDB: 119661 


AGAMMAGLOBULINEMIA, X-LINKED, 
TYPE 2; AGMX2; XLA2 
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Gene 


GDB Accession ID 


OMIM Link 




AHDS 


/t*\i~» - 1 1 conn 

GDB: 125899 


Tvvn3XTXAT PPTARn ATTfYW V-TrNrKFD WITH 

HYPOTONIA 


5 


AIC 


GDB: 118986 


CORPUS CALLOSUM, AGENESIS OF, WITH 
CHORIORETINAL ABNORMALITY 




AIED 


GDB:1 19663 


ALBINISM, OCULAR, TYPE 2; OA2 




AIH3 


GDB:131443 


AMfcLUOfclNllolo IMrxlKrii^l/V-J, 
HYPOPLASTIC TYPE; AIH3 


10 


ALAS2 


GDB: 119666 


ANEMIA, HYPOCHROMIC 




AMCD 1 


GDB:558428o 


ADTUDn^lDVDnQTQ A/fTTT TTPT PY 

CONGENITA, DISTAL 




AMELX 


GDB: 119675 


HYPOPLASTIC TYPE; AIH1 


15 


ANOP1 


GDB: 128454 


CLINICAL; ANOP1 




AR 


GDB: 120556 


AIS ANDROGEN RECEPTOR; AR 




ARAF1 


GDB: 119004 


t r t> A "C \iTITl>nvTT? CAT) r*f^\AA A ^£1 1 \/T/R AT 
V-KAr IVlUKJJNxl oAKA^IJlVl/V jO 1 1 V nvrtJU 

ONCOGENE HOMOLOG 1; ARAF1 


20 


ARSC2 


GDB: 119702 






ARSE 


GDB:555743 


/-^t TA\m"n nnvCDT ACTA DT TXT/^T ATA 1 

X-LINKED RECESSIVE; CDPX1 




ARTS 


GDB:9954651 


FATAL X-LINKED, WITH DEAFNESS AND 
LOSS OF VISION 


25 


ASAT 


GDB:9954649 


SIDEROBLASTIC, AND 
SPINOCEREBELLAR ATAXIA; ASAT 




ASSP5 


GDB:1 19019 


CITRULLINEMIA 




ATP7A 


GDB: 119395 


ATPase, Cu(2+)-TRANSPORTING, ALPHA 
dot VPPPTTnF- ATP7A MFNKFS 

SYNDROME 


30 


ATRX 


GDB:136052 


ALPHA-THAL AS SEMI A/MENTAL 
RETARDATION SYNDROME, X-LINKED; 
ATRX ALPHA-THALASSEMJA/MENTAL 
RETARDATION SYNDROME, 
NONDELETION TYPE 




AVPR2 


GDB:131475 


DIABETES INSIPIDUS, NEPHROGENIC 


35 


BFLS 


GDB:120566 


BORJESON SYNDROME; BORJ 



-134- 



NY2- 1321355.1 



■to: 





Gene 


GDB Accession ID 


OMIM Link 




BGN 


GDB: 119727 


BIGLYCAN; BGN 


5 


BTK 


GDB: 120542 


BRUTON AGAMMAGLOBULINEMIA 
TYROSINE KINASE; BTK 




BZX 


GDB:5205912 


BAZEX SYNDROME; BZX 




C1HR 


GDB: 119040 


TATA BOX BINDING PROTEIN 
(TBP)-ASSOCIATED FACTOR 2A; TAF2A 


10 


CACNA1F 


GDB:6053864 


NIGHTBLINDNESS, CONGENITAL 
STATIONARY, X-LINKED, TYPE 2; CSNB2 
CALCIUM CHANNEL, 
VOLTAGE-DEPENDENT, ALPHA IF 
SUB UNIT; CACNA1F 




CALB3 


GDB.T33780 


CALBINDIN 3; CALB3 


15 


CBBM 


GDB:9958963 


COLORBLINDNESS, 

BLUE-MONO-CONE-MONOCHROMATIC 
TYPE; CBBM 




CCT 


GDB: 119756 


CATARACT, CONGENITAL TOTAL, WITH 
POSTERIOR SUTURAL OPACITIES IN 
HFTFR07YGOTFS- 


20 


CDR1 


GDB: 119053 


CEREBELLAR DEGENERATION-RELATED 
AT JTOANTTGFN-1 • CT>R 1 • mR^4 




CFNS 


GDB:9579470 


CRANIOFRONTONASAL SYNDROME; 




CGF1 


GDB*6275867 


COGNITION 


25 


CHM 


GDB: 120400 


CHOROIDEREMIA; CHM 


CHR39C 


GDB: 119779 


CHOLESTEROL REPRESSIBLE PROTEIN 
39C; CHR39C 




CIDX 


GDB: 127736 


SEVERE COMBINED IMMUNODEFICIENCY 
DISEASE, X-LINKED, 2; SCIDX2 


30 


CLA2 


GDB: 119782 


CEREBELLAR ATAXIA, X-LINKED; CLA2 


CLCN5 


GDB.-270667 


CHLORIDE CHANNEL 5; CLCN5 FANCONI 
SYNDROME, RENAL, WITH 
NEPHROCALCINOSIS AND RENAL 
STONES NEPHROLITHIASIS, X-LINKED 
RECESSIVE, WITH RENAL FAILURE; XRN 


35 


CLS 


GDB: 119784 


RIBOSOMAL PROTEIN S6 KINASE, 90 KD, 
POLYPEPTIDE 3; RPS6KA3 
COFFIN-LOWRY SYNDROME; CLS 
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VJCUV 


CT)R Accession ID 


OMTM Link 




CMTX2 


GDB- 1283 11 


CHARCOT-MARIE-TOOTH NEUROPATHY 
X-LINKED RECESSIVE, 2; CMTX2 


5 


CMTX3 


GDB:128151 


CHARCOT-MARIE-TOOTH NEUROPATHY, 
X-LINKED RECESSIVE, 3; CMTX3 




CND 


GDB:9954627 


DERMOIDS OF CORNEA; CND 




COD1 


GDB.-l 19787 


CONE DYSTROPHY, X-LINKED, 1; COD1 


10 


COD2 


GDB:6520166 


CONE DYSTROPHY, X-LINKED, 2; COD2 


COL4A5 


GDB:120596 


COLLAGEN, TYPE IV, ALPHA- 5 CHAIN; 
COL4A5 LEIOMYOMATOSIS, 
ESOPHAGEAL AND VULVAL, WITH 
NEPHROPATHY 


15 


COL4A6 


GDB:222775 


COLLAGEN, TYPE IV, ALPHA-6 CHAIN; 
COL4A6 LEIOMYOMATOSIS, 
ESOPHAGEAL AND VULVAL, WITH 
NFPHROPATHY 




CPX 


GDB:120598 


CLEFT PALATE, X-LINKED; CPX 




CVD1 


GDB:9954659 


CARDIAC VALVULAR DYSPLASIA, 
X-LINKED 


20 


CYBB 


GDB:120513 


GRANULOMATOUS DISEASE, CHRONIC; 
CGD 




DCX 


GDB.-9823272 


LISSENCEPHALY, X-LINKED 




DFN2 


GDB:1 19091 


DEAFNESS, X-LINKED 2, PERCEPTIVE 
CONGENITAL; DFN2 


25 


DFN4 


GDB:433255 


DEAFNESS, X-LINKED 4, CONGENITAL 
SENSORINEURAL; DFN4 




DFN6 


GDB:1320698 


DEAFNESS, X-LINKED, 6, PROGRESSIVE; 
DFN6 




DHOF 


GDB: 119847 


FOCAL DERMAL HYPOPLASIA; DHOF 


30 


DIAPH2 


GDB:9835484 


DIAPHANOUS, DROSOPHILA, HOMOLOG 
OF, 2 DKC 1 GDB: 1 1 9096 DYSKERATOSIS 
CONGENITA; DKC 




DMD 


GDB: 119850 


MUSCULAR DYSTROPHY, 
PSEUDOHYPERTROPHIC PROGRESSIVE, 
DUCHENNE AND BECKER 


35 


DSS 


GDB:433750 


DOSAGE-SENSITIVE SEX REVERSAL; DSS 
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Gene 


GDB Accession ID 


OMIM Link 




DYT3 


GDB:118789 


TORSION DYSTONIA-3, X-LINKED TYPE; 
DYT3 


5 


EBM 


GDB: 11 9102 


BULLOUS DYSTROPHY, HEREDITARY 
MACULAR TYPE 




EBP 


GDB: 125212 


CHONDRODYSPLASIA PUNCTATA, 
X-LINKED DOMINANT; CDPX2; CDPXD; 
CPXD 


10 


EDI 


GDB: 119859 


ECTODERMAL DYSPLASIA, ANtflDROUC; 
EDA 




ELK1 


GDB: 119867 


ELK1, MEMBER OF ETS ONCOGENE 
FAMILY; ELKl 


15 


EMD 


GDB: 119108 


MUSCULAR DYSTROPHY, TARDIVE, 
DREIFUSS-EMERY TYPE WITH 
CONTRACTURES 


EVR2 


GDB: 136068 


EXUDATIVE VITREORETINOPATHY, 
FAMILIAL, X-LINKED RECESSIVE; EVR2 




F8C 


GDB: 119124 


HEMOPHILIA A 




F9 


GDB: 119900 


HEMOPHILIA B; HEMB 


20 


FCP1 


GDB:347490 


F-CELL PRODUCTION X-T TNTCCFn- FPP"V 




FDPSL5 


GDB: 119922 


SYNTHETASE-5; FPSL5 




FGD1 


GDB:1 19131 


SYNDROME FACIOGENITAL DYSPLASIA; 
FGDY 


25 


FGS1 


GDB.9836950 


FG SYNDROME 


FMR1 


GDB: 129038 


FRAGILE SITE MENTAL RETARDATION- 1 • 
FMR1 




FMR2 


GDB: 141 566 


FRAGILE SITE, FOLIC ACID TYPE, RARE, 
FRA(X)(q28);FRAXE 


30 


G6PD 


GDB:120621 


GLUCOSE-6-PHOSPHATE 
DEHYDROGENASE; G6PD 




GABRA3 


GDB: 119968 


GAMMA-AMINOBUTYRIC ACID 
RECEPTOR, ALPHA-3; GABRA3 




GATA1 


GDB: 125373 


GATA-BINDING PROTEIN 1; GATA1 


35 


GDI1 


GDB: 1347097 


GDP DISSOCIATION INHIBITOR 1; GDI1 
MENTAL RETARDATION, X-LINKED 
NONSPECIFIC, TYPE 3; MRX3 
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Gene 


GDB Accession D> 


OMJM Link 




GDXY 


GDB:9954629 


DYSGENESIS, XY FEMALE TYPE; GDXY 


5 


GJB1 


GDB T 25246 


CHA R COT-MAR IK-TOOTH PFRONFAT 
MUSCULAR ATROPHY, X-LINKED; CMTX1 
GAP JUNCTION PROTEIN, BETA-1, 32 KD; 

riT0 1 






UUa . 1 1 / 1 


H Y r xiKOL Y CJbKULiiMIA 




GLA 


GDB: 119272 


ANGIOKERATOMA, DIFFUSE 


10 


GPC3 


GDB:3770726 


GLYPICAN-3; GPC3 SIMPSON 

T\VC K yf HD DUT A O \ZA.TP\D /ft? . O'T^'V/'O 

JJ YoMUKrHIA o YJNDKUMi}; SDYS 




GRPR 


GDB:128035 


GASTRIN-RELEASING PEPTIDE 
RECEPTOR; GRPR 




GTD 


GDB:9954635 


GONADOTROPIN DEFICIENCY; GTD 


15 


GUST 


GDB:9954655 


MENTAL RETARDATION WITH OPTTP 

X v *-X-*1 K X ryj-t 1VL< X jTVXXJL//^. 1 AV-/JLN W X X XX Vl J. 

ATROPHY, DEAFNESS, AND SEIZURES 




HMS1 


GDB:251827 


1- HMS1 

X y X XLYX& X 


jL\j 


HPRT1 


GDB:1 19317 


HYPOXANTHINE GUANINE 
PHOSPHOR TRO^YT TR AMSFFR A W 1 • 
HPRT1 


HPT 


GDB: 119322 


HYPOPARATHYROIDISM, X-LINKED; 
HYFa 




HTP9 




HYrJiKlKlCHOblS, CONGENITAL 
GENERALIZED; CGH; HCG 


25 


HTR2C 


GDB:378202 


5 -@H YDROX YTR YPTAMINE RECEPTOR 
2C; HTR2C 




HYR ! 


GDB:9954625 


REGULATOR; HYR 




IDS 


GDB* 120521 


MTTfYYPfiT V<v A POT-T A X> TPlOQTQ TVPT7 TT 




IHG1 


GDB:1 19343 


HYPOPLASIA OF WITH CwT AUrOA/fA- TRCr 

1.x x x \sx x^r\.%jxr\. v_/l y vv i x xx \JL^r\KJ K^KJxykrXy ILavJ 


30 


IL2RG 


GDB:134807 


INTERLEUKIN-2 RECEPTOR, GAMMA; 
IL2RG SEVERE COMBINED 
IMMUNODEFICIENCY DISEASE, 
X-LINKED, 2; SCIDX2 




INDX 


GDB:9954657 


IMMUNONEUROLOGIC DISORDER, 
X-LINKED 


35 


IP1 


GDB:120105 


INCONTINENTIA PIGMENTI, TYPE I; IP1 
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GDB Accession ID 


OMIM Link 




IP2 


GDB:120106 


INCONTINENTIA PIGMENTI, TYPE II; IP2 


5 


JMS 


GDB.-204055 


MENTAL RETARDATION, X-LINKED, WITH 
GROWTH RETARDATION, DEAFNESS, 
AND 




KALI 


GDB:120116 


KALLMANN SYNDROME 1; KALI 




KFSD 


GDB.128174 


KERATOSIS FOLLICULARIS SPINULOSA 
DECALVANS CUM OPHIASI; KFSD 


10 


LI CAM 


GDB: 120133 


CLASPED THUMB AND MENTAL 
RETARDATION LI CELL ADHESION 
MOLECULF- I ICAM 




LAMP2 


GDB: 125376 


LYSOSOME-ASSOCTATFD MPMRR A"MF 
PROTEIN B; LAMP2; LAMPB 


15 


MAA 


GDB: 119372 


MICROPHTHALMIA OR ANOPHTHAT MOS 
WITH ASSOCIATED ANOMALIES; MAA 




MAFD2 


GDB: 119373 


PSYCHOSIS, X-LINKED 




MAOA 


GDB: 1201 64 


MONOAMINE OXIDASE A; MAOA 




MAOB 


GDB: 119377 


MONOAMINE OXIDASE B; MAOB 


20 


MCF2 


GDB: 120 168 


MCF.2 CELL LINE DERIVED 
TRANSFORMING SEQUENCE; MCF2 




MCS 


GDB: 128370 


MENTAL RETARDATION, X-LINKED, 
SYNDROMIC-4 WTTT-T rONOFNTTAT 

U X llXyJYWIVIiV/ "Tj VV X X XX V^WIN VJiZilNJL 1 JIJL0 

CONTRACTURES 


ZD 


MEAX 


GDB: 119383 


X-LINKED, WITH EXCESSIVE 
AU I OPHAOY; XMEA; MEAX 




MECP2 


GDB:3851454 


SYNDROME; RTT 




MF4 


GDB- 119386 






MGC1 


GDB: 120 179 


MEGALOCORNEA; MGC1; MGCN 


30 


MIC5 


GDB: 120526 


SURFACE ANTIGEN, X-LINKED; SAX 




MIDI 


GDB-.9772232 


OP1TZ SYNDROME 




MLLT7 


GDB:392309 


MYELOID/LYMPHOID OR MIXED-LINEAGE 
LEUKEMIA, TRANSLOCATED TO, 7; 
MLLT7 


35 


MLS 


GDB:262123 


MICROPHTHALMIA WITH LINEAR SKIN 
DEFECTS; MLS 
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Gene 


GDB Accession ID 


OMIM Link 


MRSD 


GDB: 119398 


MENTAL RETARDATION, SKELETAL 
DYSPLASIA, AND ABDUCENS PALSY; 
MRSD 


MRX14 


GDB: 138453 


RETARDATION, X-LINKED 14; MRX14 


MRX1 


GDB:120193 


MENTAL RETARDATION, X-LINKED 
NONSPECIFIC, TYPE 1; MRX1 


MRX20 


GDB:217050 


MENTAL RETARDATION, X-LINKED 20; 
MRX20 


MRX2 


GDB: 120 194 


RETARDATION, X-LINKED NONSPECIFIC, 
TYPE 2; MRX2 


MRX3 


GDB: 128 105 


GDP DISSOCIATION INHIBITOR 1; GDI1 
MENTAL RETARDATION, X-LINKED 


MRX40 


GDB:700754 


MENTAL RETARDATION, X-LINKED, WITH 
HYPOTONIA 

XX 1 JTW X V-fXNJLTTL 


MRXA 


GDB-9954641 


NONSPECIFIC, WITH APHASIA; MRXA 


MSD 


GDB: 11 9399 


SYNDROME 


MTM1 


GDB:1 19439 


MYOTUBULAR MYOPATHY 1 ; MTM1 


MYCL2 

1VX X V- '-L/iO 


GDR- 120209 


MYCL2 


MYP1 


VJX-/JLJ. 1Z- / / O J 


MYOPTA V-T TNTICFTV 1W1YP1 

IVl I Ui In, -rV"-i-/HN iVx-*!-/ , 1V1 Ixl 


TsTDP 




MOP RTF T"iK!FAQP» XTTYP 


lNxxO 




PATAPAPT HtTNTTAT Q VTvTTM? OA /TP 


NPHL1 


GDB:433705 


NEPHROLITHIASIS, X-LINKED RECESSIVE, 

WTTTJ PpMAr FAIT TTPF* YRTSJ 


NR0B1 


GDB- 1 1 8982 


ADRFNAL HYPOPLASIA PONOFNITAT • 
AHC 


NSX 


GDB:125596 


SYNDROME; NSX 


NYS1 


GDB:1 19458 


NYSTAGMUS, X-LINKED; NYS 


NYX ; 


GDB: 119814 


NIGHTBLINDNESS, CONGENITAL 
STATIONARY, WITH MYOPIA; CSNB1 


OA1 


GDB: 119459 


ALBINISM, OCULAR, TYPE 1; OA1 
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1 


OASD I 


GDB: 138457 

1 


SENSORINEURAL DEAFNESS; OASD 


5 1 


OCRL 1 


GDB: 119461 1 


LOWE OCULOCEREBRORENAL 
SYNDROME; OCRL 




ODT1 


GDB: 125360 


TEETH, ABSENCE OF 




OFD1 


GDB: 120248 ^ 


OROFACIODlvjrll AL o iJNDK.ulvic i, urui 




OPA2 1 


GDB: 125358 1 


OPTIC ATROPHY 2; OPA2 


10 


OPD1 J 


GDB: 120249 J 


OTOPALATODIGITAL SYNDROME 




OPEM 1 


GDB: 119467 1 


OPHTHALMOPLEGIA EXTERNAL, AND 




OPN1LW 


GDB: 120724 


rnr orrt TNDNESS PARTIAL, PROT AN 
SERIES; CBP 


15 


OPN1MW 


GDB .120622 : 


mi r»Rwr TNTCNTESS PARTIAL, DEUTAN 
SERIES; CBD; DCB 




OTC 


GDB: 1 19468 


ORNITHINE TRANSCARBAMYLASE 
DEFICIENCY, n i r oKAW11vivjine.ivu«. 
TO; OTC 


zu 


P3 


GDB.9954667 


^PROTEIN P3 




PDHA1 


1 GDB: 118895 


PYRUVATE DEHYDROGENASE COMPLEX, 
El-ALPHA POLYPEPTIDE-1; PDHA1 




| PDR 


| GDB:203409 


1 AMYLOIDOSIS, FAMILIAL CUTANEOUS 




|PFC 


| GDB: 120275 


1 PROPERDIN DEFICIENCY, X-LlNraiu 


25 


| PFKFB1 


|GDB:125375 


| 6-@PHOSPHOFRUCTO-2-KINASE; PFKFB1 




jpGKl 


| GDB: 120282 


PHOSPHOGLYCERATE KINASE 1; PGK1 




| PGK1P1 


j GDB: 120283 


mjncpwnm YPFRATE KINASE 1: PGK1 


30 




1 frOR- 128372 


DANDY- WALKER MALFORMATION WITH 
MENTAL RETARDATION, BASAL 
| GANGLIA DISEASE, 




1 PHEX 


1 GDB: 120520 


HYPOPHOSPHATEMIA, VITAMIN 
D-RESISTANT RICKETS; HYP 




1 PHKA1 


1 GDB: 120285 


PHOSPHORYLASE KINASE, ALPHA 1 
1 SUBUNIT (MUSCLE); PHKA1 


35 


| PHKA2 


| GDB:127279 


GLYCOGEN STORAGE DISEASE VIII 
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PHP 


GDB:119494 


PANHYPOPITUITARISM; PHP 


5 




UL/o.lJaljo 


CLASS A; PIGA 




PLPl 


GDB:120302 


PROTEOLIPID PROTEIN, MYELIN; PLP 




POF1 


GDB:120716 


PREMATURE OVARIAN FAILURE 1; POF1 




POLA 


GDB:120304 


POLYMERASE, DNA, ALPHA; POLA 


i r\ 


POU3F4 


GDB:351386 


DEAFNESS, CONDUCTIVE, WITH STAPES 
FDCATION 




PPMX 


GDB:9954669 


RETARDATION WITH PSYCHOSIS, 
PYRAMIDAL SIGNS, AND 
MACROORCHIDISM 


15 


PRD 


GDB:371323 


DYSPLASIA, PRIMARY 


PRPS1 


GDB: 1203 18 


PHOSPHORIBOSYLPYROPHOSPHATE 
SYNTHETASE-I- PRPS1 




PRPS2 


GDB:120320 


PHOSPHORIBOSYLPYROPHOSPHATE 
SYNTHETASE-H; PRPS2 


20 


PRS 


GDB:128368 


MENTAL RETARDATION, X-LINKED, 
SYNDROMIC-2, WITH DYSMORPHISM 

A\m PlnPRPlP AT 

AINU L^JixsJiol\jf\Lf 




PpTQ 


\JUtj . 1 Z o j O / 


RETARDATION SYNDROME; PRTS 




PSF2 


GDB.l 19519 


TRANSPORTER 2, ABC; TAP2 


25 


RENBP 


GDB: 133792 


RENIN-BINDENG PROTEIN; RENBP 




"DTJXTQ 1 




RENPENNING TYPE 




KrZ 




KJb I UN 1 1 lo rlLriVlriiN 1 UoA-Z, KJrZ 




xvro 


VJJ-/D. 1 ZjjO 1 


PTf^A/fPXTTOQ A -A • PPA 
xlVJIVJLCIN IVJor\.-V), tvTO 


30 


RPGR 


GDB: 118736 


RETINITIS PIGMENTOSA-3; RP3 




RPS4X 


GDB:128115 


RIBOSOMAL PROTEIN S4, X-LINKED; 
RPS4X 




RPS6KA3 


GDB:365648 


RIBOSOMAL PROTEIN S6 KINASE, 90 KD, 
POLYPEPTIDE 3; RPS6KA3 


35 


RSI 


GDB:1 19581 


RETINOSCHISIS; RS 
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Sll 


GDB: 120361 


ANTIGEN, X-LINKED, SECOND; SAX2 


5 


SDYS 


GDB: 119590 


GLYPICAN-3; GPC3 SIMPSON 
DYSMORPHIA SYNDROME; SDYS 




SEDL 


GDB 120372 


SPONDYLOEPIPHYSEAL DYSPLASIA 
LATE; SEDL 




SERP1NA7 


GDB: 120399 


THYROXINE-BINDING GLOBULIN OF 
SERUM; TBG 


10 


SH2D1A 


GDB: 120701 


IMMUNODEFICIENCY, X-LINKED 
PROGRESSIVE COMBINED VARIABLE 




SHFM2 


GDB:226635 


SPLIT-HAND/SPLIT-FOOT ANOMALY, 

A.- JLillN JSJCX/ 




SHOX 


GDB:6 118451 


SHORT STATURE; SS 


15 


SLC25A5 


GDB: 125 190 


ADENINE NUCLEOTIDE TRANSLOCATOR 








LETHAL INFANTILE 




SRPX 


GDB:3811398 


RETINITIS PIGMENTOSA-3; RP3 


20 


oKo 




TV/fPMTAT PPTAPnATTHXT Y-T TKTtfVlTl 
IVJLC/IN I J\Xa i\X2j 1 J\s\XJjt\ 1 ivy IN 5 Ar JLilN SSS^Uy 

SNYDER-ROBINSON TYPE 




olo 








O Y IN I 


uL/D . 1 1 y ouo 


C VTVT A "DQTM T« CVM1 
o I i\ AJrollN 1, o x IN 1 




SYP 


GDB: 125295 


SYNAPTOPHYSIN; SYP 


25 


TAF1 


GDB- 120573 


TATA BOX BINDING PROTEIN 
(TBP)-ASSOCIATED FACTOR 2A; TAF2A 




TAZ 


GDB: 120609 


CARDIOMYOPATHY, DILATED 3A; CMD3A 
ENDOCARDIAL FIBROELASTOSIS-2- EFE2 




TBX22 


GDB- 10796448 


CLEFT PALATE, X-LINKED' CPX 


30 


TDD 


GDB:1 19610 


MALE PSEUDOHERMAPHRODITISM: 
DEFICIENCY OF TESTICULAR 
17,20-DESMOLASE; 




TFE3 


GDB: 125870 


TRANSCRIPTION FACTOR FOR 
IMMUNOGLOBULIN HEAVY-CHAIN 
ENHANCER-3; TFE3 


35 


THAS 


GDB:128158 


THORACOABDOMINAL SYNDROME; TAS 
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THC 


GDB:125361 


THROMBOCYTOPENIA, X-LMKED; THC; 
XLT 


TIMM8A 


GDB: 119090 


DEAFNESS 1, PROGRESSIVE; DFN1 


TIMP1 


GDB: 119615 


TISSUE INHIBITOR OF 
METALLOPROTEINASE-1; TIMP1 


TKCR 


GDB: 119616 


TORTICOLLIS, KELOIDS, 
CRYPTORCHIDISM, AND RENAL 
DYSPLASIA- TKC 


TNFSF5 


GDB: 120632 


IMMUNODEFICIENCY WITH INCREASED 
IeM 


UBE1 


GDB: 118954 


UBIQUITIN-ACTTVATING ENZYME 1; UBE1 


UBE2A 


GDB:131647 


UBIQUITIN-CONJUGATING ENZYME E2A; 
UBE2A 


WAS 


GDB: 120736 


WISKOTT-ALDRICH SYNDROME; WAS 


WSN 


GDB:125864 


PARKINSONISM, EARLY-ONSET, WITH 
MENTAL RETARDATION 


WTS 


GDB: 128373 


MENTAL RETARDATION, X-LINKED, 
SYNDROMIC-6, WITH GYNECOMASTIA 
AND OBESITY- 


WWS 


GDB: 120497 


WIEACKER SYNDROME 


XIC 


GDB: 120498 


X-INACTIVATION-SPECIFIC TRANSCRIPT; 
XIST 


XIST 


GDB- 126428 


XIST 


XK 


GDB: 120499 


Xk LOCUS 


XM 


GDB: 119634 


XM SYSTEM 


XS 


GDB: 119636 


LUTHERAN SUPPRESSOR, X-LINKED; XS; 
LUXS 


ZFX 


GDB: 120502 


ZINC FINGER PROTEIN, X-LINKED; ZFX 


ZIC3 


GDB:249141 


HETEROTAXY, X-LINKED VISCERAL; 
HTX1 


ZNF261 


GDB:9785766 


MENTAL RETARDATION, X-LINKED; 
DXS6673E 


ZNF41 


GDB: 125865 


ZINC FINGER PROTEIN-41; ZNF41 
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ZNF6 


GDB:120508 


ZINC FINGER PROTEIN-6; ZNF6 



Table 25: Genes, Locations and Genetic Disorders on Chromosome Y 



Gene 


GDB Accession ID 


OMIM Link 


AMELY 


GDB: 119676 


AMELOGENIN, Y-CHROMOSOMAL; AMELY 


ASSP6 


GDB: 119020 


CITRULLINEMIA 


AZF1 


GDB: 119027 


AZOOSPERMIA FACTOR 1; AZF1 


AZF2 


GDB:456131 


AZOOSPERMIA FACTOR 2; AZF2 


DAZ 


GDB:635890 


DELETED IN AZOOSPERMIA; DAZ 


GCY 


GDB: 119267 


CONTROL, Y-CHROMOSOME INFLUENCED; 
GCY 


RPS4Y 


GDB: 128052 


RIBOSOMAL PROTEIN S4, Y-LINKED; RPS4Y 


SMCY 


GDB:5875390 


HISTOCOMPATIBILITY Y ANTIGEN; HY; 
HYA 


SRY i 


GDB: 125556 


SEX-DETERMINING REGION Y; SRY 


ZFY 


GDB: 120503 


ZINC FINGER PROTEIN, Y-LINKED; ZFY 



Table 26: Genes, Locations and Genetic Disorders in Unknown or Multiple 

Locations 



Gene 


GDB Accession ID 


OMIM Link 


ABAT 


GDB:581658 


GAMMA-AMINOBUTYRATE TRANSAMINASE 


AEZ 


GDB: 128360 


ACRODERMATITIS ENTEROPATHICA, 
ZINC-DEFICIENCY TYPE; AEZ 


AFA 


GDB:265277 


FILIFORME ADNATUM AND CLEFT PALATE 


AFD1 


GDB:265292 


DYSOSTOSIS, TREACHER COLLINS TYPE, 
WITHLIMB ANOMALIES 


AGS1 


GDB:10795417 


ENCEPHALOPATHY, FAMILIAL INFANTILE, 
WITH CALCIFICATION OF BASAL GANGLIA 


ASAH 


GDB.-6837715 


FARBER LIPOGRANULOMATOS1S 


ASD1 


GDB:6276019 


ATRIAL SEPTAL DEFECT; ASD 
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Gene 


GDB Accession ID 


OMIM Link 


5 


ASMT 


GDB:I36259 


CETYLSEROTONIN METHYLTRANSFERASE; 
ASMT ACETYLSEROTONIN 
METHYLTRANSFERASE, Y-CHROMOSOMAL; 
ASMTY; HIOMTY 




BCH 


GDB: 118758 


CHOREA, HEREDITARY BENIGN; BCH 




CCAT 


GDB: 118738 


CATARACT, CONGENITAL OR JUVENILE 




CECR9 


GDB-.10796163 


CAT EYE SYNDROME; CES 


10 


CEPA 


GDB:581848 


CONTROL, CONGENITAL FAILURE OF 




CHED2 


GDB:9957389 


CORNEAL DYSTROPHY, CONGENITAL 
HEREDITARY 




CLAI 


GDB: 119781 


CEREBELLOPARENCHYMAL DISORDER III 


15 


CLA3 


GDB.128453 


CEREBELLOPARENCHYMAL DISORDER I; CPD I 


CLN4 


GDB: 125229 


CEROID-LIPOFUSCINOSIS, NEURONAL 4; CLN4 




CPO 


GDB: 119070 \ 


COPROPORPHYRIA 


20 


CSF2RA 


GDB:1 18777 


COLONY STIMULATING FACTOR 2 RECEPTOR, 
ALPHA; CSF2RA 
GRANULOC YTE-MACROPH A OF 
COLONY-STIMULATING FACTOR RECEPTOR, 
ALPHA SUBUNIT, 




CTS1 


GDB: 118779 


CARPAL TUNNEL SYNDROME; CTS; CTS1 




DF 


GDB 132645 


FACTOR D 

X rvV/ X WAV JLS 


ZD 


DIH1 


GDB:439243 


DIAPHRAGMATIC 


DWS 


GDB:128371 


SYNDROME; DWS 




DYT2 


GDB:1 18788 


DYSTONIA MUSCULORUM DEFORMANS 2; 
nYT? 




DYT4 


GDB:433751 


DYSTONIA MUSCULORUM DFFORMANS 4- 
DYT4 


30 


EBR3 


GDB:1 18739 


EPIDERMOLYSIS BULLOSA DYSTROPHICA 
NEUROTROPHICA 




ECT 


GDB: 128640 


CENTRALOPATHIC EPILEPSY 




EEF1A1L14 


GDB: 1327 185 


PROSTATIC CARCINOMA ONCOGENE PTI-1 


35 


EYCL2 


GDB:4642815 


EYE COLOR-3; EYCL3 
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Gene 


GDB Accession ID 


OMIM Link 




FA1 


GDB:118795 


FANCONI ANEMIA, COMPLEMENTATION 
GROUP A- FACA 


5 


FANCB 


GDB:9864269 


FANCONI PANCYTOPENIA. TYPE 2 




GCSH 


GDB: 126842 


HYPERGLYCINEMIA ISOLATED NONKETOTIC 
TYPE ffl; NKH3 




GCSL 


GDB: 132139 


ISOLATED NONKETOTIC, TYPE IV; NKH4 


10 


GDF5 


GDB.-433948 


CARTILAGE-DERIVED MORPHOGENETIC 
PROTEIN 1 




GEP 


GDB: 119985 


GASTRIC INHIBITORY POLYPEPTIDE; GIP 




GTS 


GDB: 118807 


GILLES DE LA TOURETTE SYNDROME- GTS 




HHG 


GDB: 118740 


HYPERGONADOTROPIC HYPOGONADISM- HHG 


15 


HMI 


GDB:265275 


OF ITO* HMI 

V^X X X W j XXlTXX 




HOAC 


GDB:1 18812 


DEAFNESS, CONGENITAL, AUTOSOMAL 
RECESSIVE 




HOKPP2 


GDB:595535 


HYPOKALEMIC PERIODIC PxARALYSIS TYPE IT" 

X X X X X_TX"VX VUJ^XTIXV/ X JUlViv/JL/lV/ X ZxlVrXJU A UXU} X X X x~« Ax j 

HOKPP2 


20 


HRPT1 


GDB: 125252 


HYPERPARATHYROIDISM, FAMILIAL PRIMARY 




HSD3B3 


GDB 676973 


GIANT CELL HEPATITIS NEONATAL 

xjirvii X ^» m-jJutJij Ilia xxliiiU) IsJuvlirVl/VL/ 




HTC1 


GDB.265286 


HYPERTRICHOSIS UNIVERSALIS CONGENITA, 
AMBRAS TYPE; HTC1 




HV1S 


GDB:9955009 


HERPES VIRUS SENSnTVITY; HV1S 


25 


ICRl 


GDB: 127785 


LAMELLAR, AUTOSOMAL DOMINANT FORM 




ICRS 


GDB: 127789 


ICHTHYOSIS CONGENITA, HARLEQUIN FETUS 
TYPF 


30 


IL3RA 


GDB: 128985 


INTERLEUKIN-3 RECEPTOR, ALPHA; IL3RA 
INTERLEUKIN-3 RECEPTOR, 
Y-CHROMOSOMAL; IL3RA 


KAL2 


GDB:265288 


KALLMANN SYNDROME 2; KAL2 




KMS 


GDB: 118827 


SYNDROME; KMS 




KRT18 


GDB: 120127 ] 


KERATIN 18;KRT18 


35 


KSS 


GDB:9957718 


KEARNS-SAYRE SYNDROME; KSS 
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Gene 


GDB Accession ID 


OMIM Link 


5 


LCAT 


GDB: 119359 


FISH-EYE DISEASE; FED 

T FPTTVrTM-r^T-IOT T7QTT7POT A PVT TD AMOUCD a CD 
J^EA—l l rLLLN .^nwLJCrO 1 JtilvL/lj AC YL1 IvAIN or jbrvAoxi 

DEFICIENCY 




LIMM 


GDB:9958!61 


MYOPATHY, MITOCHONDRIAL, LETHAL 
INFANTILE; LIMM 




MANBB 


GDB: 125262 


MANNOSIDOSIS, BETA; MANB1 




MCPH2 


vil/JJ»70UJVJ J 


lVJJL^ rvvJ \ZiJT rl/vL, 1 , 1VIC I 


10 


MEB 




nTQP A QR 
JL7l»3lZf/\oCi 




MELAS 


\JiyJJ.77JJOJJ 


A/TCT AC CVMT»]?nA/fl7 
lVUDL,/\o a I IN U i\\J iVl ti 




MIC2 


GDB- 1201 84 


SURFACE ANTIGEN, Y-CHROMOSOMAL; MIC2Y 


15 


MPFD 


GDR-41Q179 


r^OXrr»lTMTT AT TI/TTXI T7TOT7"D TVDC 

DISPROPORTION 




MS 


GDB.-229116 


SCLEROSIS; MS 




MSS 


GDB: 11 8743 


MARINESCO-SJOGREN SYNDROME; MSS 




MTATP6 


GDB: 11 8897 


ATP SYNTHASE 6; MTATP6 


20 


MTCOl 


GDB: 11 8900 


COMPLEX IV, CYTOCHROME c OXIDASE 
SUBUNIT I; MTCOl; COI 






GDB: 11 8902 


CYTOCHROME c OXIDASE III; MTC03 




MIC YJB 


GDB: 11 8906 


COMPLEX III, CYTOCHROME b SUBUNIT 




MTND1 


GDB: 11 8911 


COMPLEX I, SUBUNIT ND1 ; MTND1 


25 


MTND2 


GDB:118912 


COMPLEX I, SUBUNIT ND2; MTND2 




MTND4 

1VJL J. IN J-/*? 




^UMrLJiX I, SUBUNll ND4; MTND4 




MTND5 


GDB: 11 891 6 


COMPLEX I, SUBUNIT ND5; MTND5 




MTND6 


GDB.118917 


COMPLEX I, SUBUNIT ND6; MTND6 


30 


MTRNR1 


GDB: 11 8920 


RIBOSOMAL RNA, MITOCHONDRIAL, 12S; 
MTRNR1 




MTRNR2 


GDB: 11 8921 


RIBOSOMAL RNA, MITOCHONDRIAL, 16S; 
MTRNR2 


35 


MTTE 


GDB: 118926 


TRANSFER RNA, MITOCHONDRIAL, GLUTAMIC 
ACID; MTTE 
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Gene 


GDB Accession ID 


OMIM Link 


MTTG 


GDB: 118933 


TRANSFER RNA, MITOCHONDRIAL, GLYCINE: 
MTTG 


MTTI 


GDB: 118935 


TRANSFER RNA, MITOCHONDRIAL, 
ISOLEUCINE; MTTI 


MTTK 


GDB: 118936 


MERRF SYNDROME TRANSFER RNA, 
MITOCHONDRIAL, LYSINE; MTTK 


MTTL1 


GDB: 118937 


MERRF SYNDROME TRANSFER RNA, 
MITOCHONDRIAL, LEUCINE, 1; MTTL1 


MTTL2 


GDB:1 18938 


TRANSFER RNA, MITOCHONDRIAL, LEUCINE, 
2; MTTL2 


MTTN 


GDB: 118940 


TRANSFER RNA, MITOCHONDRIAL, 
ASPARAGINE; MTTN 


MTTP 


GDB: 118941 


TRANSFER RNA, MITOCHONDRIAL, PROLINE; 
MTTP 


MTTS1 


GDB: 118944 


TRANSFER RNA MITOCHONDRIA! SFRTNF 1- 
MTTS1 


NAMSD 


GDB:681237 


NEUROPATHY MOTOR-SENSORY TYPF IT 
WITH DEAFNESS AND MENTAL RETARDATION 


NODAL 


GDB:9848762 


NODAL, MOUSE, HOMOLOG OF 


OCD1 


GDB: 118846 


DISORDER- 1; OCD1 


0PD2 


GDB: 131394 


SYNDROME 


PCK2 


GDB: 137 198 


PHOSPHOENOT PYRTIVATF PARRfYVWrrvT A QT7 o 
MITOCHONDRIAL; PCK2 


PCLD 


GDB:433949 


POLYCYSTIC LIVER DISEASE; PLD 


PCOSl 


GDB: 139 1802 


STEIN-LEVENTHAL SYNDROME 


PFKM 


GDB: 120277 


GLYCOGEN STOR A OF T1TSFA<!F VTT 


PKD3 


GDB: 127866 


KIDNEY DISEASE 3, AUTOSOMAL DOMINANT- 
PKD3 


PRCA1 


GDB:342066 


PROSTATE CANCER; PRCA1 


PROl 


GDB: 128585 




PROP1 


GDB:9834318 


PROPHET OF PIT1, MOUSE, HOMOLOG OF; 
PROP1 


RBS 


GDB: 118862 


ROBERTS SYNDROME; RBS 
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<& 0:3 -one is* a- 04 a o^e fi *oe 



10 



Gene 


SDB Accession ID ' 


OMIMLink 


RFXAP ( 


3DB:9475355 


„_„ tTr ATADvcArrnR V- ASSOCIATED 
PROTEIN; RFXAP 


RP 


GDB .9958158 


RETINITIS PIGMENTOSA-8 


SLC25A6 


GDB-.125184 


ADENINE NUCLEOTIDE TRANSLOCATOR 3; 
ANT3 ADENINE NUCLEOTIDE rKANMAJ^Aivnv 
3 Y-CHROMOSOMAL; ANT3Y 




GDB-250333 


SPASTIC PARAPLEGIA-5B, AUTOSOMAL 
RECESSIVE; SPG5B 


STO 


GDB.439375 


r«CDDDD AT f^TO AMTTSM 


suox 


GDB-.5584405 


SULFOCYSTEINURIA 


TC21 


GPB:5573831 


ONCOGENE TC21 


THM 


GDB:439378 


FAMILIAL . . 


TST 


GDB: 134043 


RHODANESE; RDS 


TTD 


GDB:230276 


TRICHOTHIODYSTROPHY; TTD 



15 



The present invention is not to be limited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in addition 
20 to those described will become apparent to those skilled in the art from the foregoing 
description and accompanying figures. Such modifications are intended to fall within the 

scope of the appended claims. 

Various publications are cited herein, the disclosures of which are 

incorporated by reference in their entireties. 

25 
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The invention can be illustrated by the following embodiments enumerated 
in the numbered paragraphs that follow: 



1 . A method for identifying a test compound that modulates premature 
translation termination or nonsense-mediated mRNA decay, comprising the steps of (a) 
contacting a detectably labeled target RNA molecule with a library of test compounds under 
conditions that permit direct binding of the labeled target RNA to a member of the library of 
test compounds so that a detectably labeled target RNArtest compound complex is formed; 
" (b) separating the detectably labeled target RNA:test compound complex formed in step (a) 
from uncomplexed target RNA molecules and test compounds; and (c) determining a 
structure of the test compound bound to the RNA in the RNArtest compound complex. 



2* The method of paragraph 1 in which the target RNA molecule 
j 5 contains regions of 28S rRNA or analogs thereof 

3. The method of paragraph 1 in which the detectably labeled RNA is 
labeled with a fluorescent dye, phosphorescent dye, ultraviolet dye, infrared dye, visible 
dye, radiolabel, enzyme, spectroscopic colorimetric label, affinity tag, or nanoparticle. 

20 

4. The method of paragraph 1 in which the test compound is selected 
from a combinatorial library comprising peptoids; random bio-oligomers; diversomers such 
as hydantoins, benzodiazepines and dipeptides; vinylogous polypeptides; nonpeptidal 
peptidomimetics; oligocarbamates; peptidyl phosphonates; peptide nucleic acid libraries; 

2^ antibody libraries; carbohydrate libraries; and small organic molecule libraries, including 
but not limited to, libraries of benzodiazepines, isoprenoids, thiazolidinones, 
metathiazanones, pyrrolidines, morpholino compounds, or diazepindiones. 



5. The method of paragraph 1 in which screening a library of test 
compounds comprises contacting the test compound with the target nucleic acid in the 
presence of an aqueous solution, the aqueous solution comprising a buffer and a 
combination of salts, preferably approximating or mimicking physiologic conditions. 



6. The method of paragraph 5 in which the aqueous solution optionally 
further comprises non-specific nucleic acids comprising DNA, yeast tRNA, salmon sperm 
DNA, homoribopolymers, and nonspecific RNAs. 
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7. The method of paragraph 5 in which the aqueous solution further 
comprises a buffer, a combination of salts, and optionally, a detergent or a surfactant In 
another embodiment, the aqueous solution further comprises a combination of salts, from 
about 0 mM to about 100 mM KC1, from about 0 mM to about 1 M NaCl, and from about 0 
mM to about 200 mM MgCl 2 . In a preferred embodiment, the combination of salts is about 
1 00 mM KC1, 500 mM NaCl, and 10 mM MgCl 2 . In another embodiment, the solution 
optionally comprises from about 0.01% to about 0.5% (w/v) of a detergent or a surfactant 

j 8. Any method that detects an altered physical property of a target 

nucleic acid complexed to a test compound from the unbound target nucleic acid may be 
used for separation of the complexed and non-complexed target nucleic acids in the method 
of paragraph 1. In a preferred embodiment, electrophoresis is used for separation of the 
complexed and non-complexed target nucleic acids. In a preferred embodiment, the 

^ electrophoresis is capillary electrophoresis. In other embodiments, fluorescence 

spectroscopy, surface plasmon resonance, mass spectrometry, scintillation, proximity assay, 
structure-activity relationships ("SAR") by NMR spectroscopy, size exclusion 
chromatography, affinity chromatography, and nanoparticle aggregation are used for the 
separation of the complexed and non-complexed target nucleic acids. 

20 

9. The structure of the test compound of the RNArtest compound 
complex of paragraph 1 is determined, in part, by the type of library of test compounds. In a 
preferred embodiment wherein the combinatorial libraries are small organic molecule 
libraries, mass spectroscopy, NMR, or vibration spectroscopy are used to determine the 
structure of the test compounds. 



30 



35 
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WHAT IS CLAIMED IS: 

1 . A method for identifying a test compound that modulates premature 
translation termination or nonsense-mediated mRNA decay, comprising the steps of: 

(a) contacting a detectably labeled target RNA molecule with a 
library of test compounds under conditions that permit direct 
binding of the labeled target RNA to a member of the library 
of test compounds so that a detectably labeled target RN Attest 
compound complex is formed; 

(b) separating the detectably labeled target RNA:test compound 
complex formed in step(a) from uncomplexed target RNA 
molecules and test compounds by capillary gel 
electrophoresis; and 

(c) determining a structure of the test compound bound to the 
RNA in the RNA:test compound complex by mass 
spectroscopy. 
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ABSTRACT 

The present invention relates to a method for screening and identifying test 



compounds that modulate premature translation termination and/or nonsense-mediated 
messenger ribonucleic acid ("mRNA") by interacting with a preselected target ribonucleic 
acid ("RNA"). In particular, the present invention relates to identifying test compounds that 
bind to regions of the 28S ribosomal RNA ("rRNA") and analogs thereof. Direct, non- 
competitive binding assays are advantageously used to screen libraries of compounds for 
those that selectively bind to a preselected target RNA. Binding of target RNA molecules to 
a particular test compound is detected using any physical method that measures the altered 
physical property of the target RNA bound to a test compound. The structure of the test 
compound attached to the labeled RNA is also determined. The methods used will depend, 
in part, on the nature of the library screened. The methods of the present invention provide 
a simple, sensitive assay for high-throughput screening of libraries of compounds to identify 
pharmaceutical leads. 



20 
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Figure 1 
Sheet 1/11 
Attorney Docket No. 10589-021 
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Figure 2 
Sheet 2/11 
Attorney Docket No. 10589-021 
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Figure 4 
Sheet 4/11 
Attorney Docket No. 10589-021 
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Figure 5 
Sheet 5/11 
Attorney Docket No. 10589-021 
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Figure 6 
Sheet 6/11 
Attorney Docket No. 10589-021 
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Figure 7 
Sheet 7/11 
Attorney Docket No. 10589-021 




7- DMSO 

8. Paromomycin 

9. Compound A (A, B, E. F, G, H) 

10. Compound B (A, B, E, F, G, H) 

11. Compound C (C, D) 

12. Compound D (A, B, E, F, G, H) 
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Figure 8 
Sheet 8/11 
Attorney Docket No. 10589-021 
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7. DMSO 

8. paromomycin 

9. Compound E (J) 

10. Compound F (J) 

11. Compound 6 (I) 

12. Compound H (J) 
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Figure 9 
Sheet 9/11 
Attorney Docket No. 10589-021 
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1. Compound I 1 

2. Compound H 

3. Compound G 

4. Compound F > KE 

5. Compound J 

6. Compound E 

7. 1%DMSO 



A 



^ ***** 



A. Compound K and Compound G 

B. Compound E 

C. Compound E 

D. Compound E 
(compare lane 6 to lane 7 
for A, B, C, D and lane 3 to 
lane 7 for D) 
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Figure 10 
Sheet 10/11 
Attorney Docket No. 10589-021 




comparo lanes 3. 4, and 6 to lane 1 and 5. Compound G 11 . Compound G 

comparo lanes 9, 10, and 11 to Jane 7 6. Compound H 12. Compound H 
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Figure 11 
Sheet 11/11 
Attorney Docket No. 10589-021 



Competition assay: UGA luclforaio RNA and cold Domalnll RNA 
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